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GENE REGULMIOM IN TRANSGENIC ANIMALS USING 

A TRANSPOSON-BASED VECTOR 

10 The X.LS. Government has certain rights in this invention. The development of 

this invention was partially funded by the United States Government under a HATCH 
grant from the United States PepaitE^exit of Agriculture, partially fended by the 
United States Government with fbnnula 1433 fends from the limited Slates 
Department of Agrieultae and partially fended by the United States Government 

1 5 under contract DAAD 19-02016 awarded by the Army, 

FIELD OF THE INVENTION 

The present invention relates generally to cell-specific gene reflation in 
transgenic animals. Animals may he made transgenic through administration of a 

20 transposon-based vector through any method of administration including pranuclear 
injection, or intraembo'onic, mtratesticular 3 intraovidncta! or intravenous 
administration. These transgenic animals contain the gene of interest in all cells, 
including germ ceils. Animals may also he made transgenic by targeting specific cells 
for uptake and gene incorporation of the transposon-based vectors. Stable 

25 incorporation of a gene of interest into cells of the transgenic animals is demonstrated 
by expression of the gene of interest m a cell, wherein expression is regulated by a 
promoter sequence. The promoter sequence may be provided as a transgene along 
with the gene of interest or may be endogenous to the cell. The promoter sequence 
may be constitutive or inducible, wherein inducible promoters include tissue-specific 

3D promoters, developmental^ regulated promoters and chemically inducible promoters. 

BACKGROUND OF THE 

Transgenic animals are desirable for a variety of reasons, including meir 
potential as biological factories to produce desired molecules for pharmaceutical, 
35 diagnostic and Industrial uses. This potential is attractive to the industry due to the 
inadequate capacity in facilities used for recombinant production of desired molecules 
and the increasing demand by the pharmaceutical industry for use of these facilities. 
Numerous attempts to produce transgenic animals have met several problems, 
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including low rates of gene incorporation and unstable gene incorporation. 
Accordingly, improved gene technologies are needed for the development of 
transgenic animals for the production of desired molecules, 

Improved gene deliver)' technologies are also needed for the treatment of 
5 disease in animals and humans. Many diseases and conditions can be treated with 
gene-delivery technologies, which provide a gene of interest to a patient suffering 
from the disease or the condition. An example of such disease is Type \ diabetes. 
Type 1 diabetes is an autoimmune disease that ultimately results in destruction of the 
insulin producing p~ceiis in the pancreas. Although patients with Type 1 diabetes 

10 may be treated adequately with insulin injections or insulin pumps, these therapies are 
only partially effective. Insulin replacement, such as via insulin injection or pump 
administration, cannot fully reverse the defect in the vascular endothelium found in 
the hyperglycemic state (Pieper et ai, 1996. Diabetes Res- Clin, Pract. Suppl S!5?~ 
SI 62), In addition, hyper- and hypoglycemia occurs frequently despite intensive 

15 home blood glucose monitoring. Finally, careful dietary constraints are needed to 
maintain an adequate ratio of consumed calories consumed. This often causes major 
psychosocial stress for many diabetic patients. Development of gene therapies 
providing delivery of the insulin gene into the pancreas of diabetic patients could 
overcome many of these problems and result in improved life expectancy and quality 

20 of life. 

Several of the prior art gene delivery technologies employed viruses that are 
associated with potentially undesirable side effects and safety concerns. The majority 
of current gene-delivery technologies useful for gene therapy rely on virus-based 
delivery vectors, such as adeno and adeno-associated viruses, retroviruses, and other 

25 viruses, which have been attenuated to no longer replicate. (Kay, M.A., et ah 2001. 
Nature Medicine 7:33-40). 

There are multiple problems associated with the use or viral vectors. First, 
they are not tissue-specific. In fact, a gene therapy trial using adenovirus was recently 
baited because the vector was present in the patient's sperm (Gene trial to proceed 

30 despite fears that therapy could change child's genetic makeup, The New York 
Times, December 23, 2001). Second, viral vectors are likely to be transiently 
incorporated, which necessitates re-treating a patient at specified time intervals. (Kay, 
M*A, y et ah 2001 . Nature Medicine 7:33-40). Third, there is a concern that a viral- 
based vector could revert to its virulent form and cause disease. Fourth, viral-based 

35 vectors require a dividing cell for stable integration. Fifth, viral-based vectors 
indiscriminately integrate into various cells and tissues, which can result in 
undesirable germline integration. Sixth, the required high titers needed to achieve the 
desired effect have resulted in the death of one patient and they are believed to be 

? 
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responsible for induction of cancer in a separate study* (Science, News of the Week, 
October 4, 2002). 

Accordingly, what is needed is a new vector to produce transgenic animals 
and humans with stably incorporated genes, which vector does not cause disease or 
5 other unwanted side effects. There is also a need for DNA constructs that would be 
stably incorporated into the tissues and cells of animals and humans, including cells in 
the resting state, which axe not replicating. There is a further recognized need in the 
art for DNA constructs capable of delivering genes to specific tissues and cells of 
animals and humans. 

10 When incorporating a gene of interest into an animal for the production of a 

desired protein or when incorporating a gene of interest in an animal or human for the 
treatment of a disease, it is often desirable to selectively activate incorporated genes 
using inducible promoters. These inducible promoters are regulated by substances 
either produced or recognised by the transcription control elements within the cell in 

15 which the gene is incorporated^ In many instances, control of gene expression is 
desired in transgenic animals or humans so that incorporate! genes are selectively 
activated at desired times and/or under the influence of specific substances, 
Accordingly, what is needed is a- .means to selectively activate genes introduced into 
the genome of cells of a transgenic animal or human. This can be taken a step further 

20 to cause incorporation to be tissue-specific, which prevents widespread gene 
incorporation throughout a patient's body (animal or human). This decreases the 
amount of DNA needed for a treatment, decreases the chance of incorporation in 
gametes, and targets gene deli very , incorporation, and expression to the desired tissue 
where the gene is needed to function. 

25 

SUMMARY OF THE INVENTION 

The present invention addresses the problems described above by providing 
new, effective and efficient compositions for producing transgenic animals and for 
treating disease in animals or humans. Transgenic animals include all egg-laying 

30 animals and milk-producing animals, Transgenic animals further include but are not 
limited to avians, fish, amphibians, reptiles, insects, mammals and humans* In a 
preferred embodiment, the animal is an avian animal. In another preferred 
embodiment, the animal is a milk-producing animal, including but not limited to 
bovine, porcine, ovine and equine animals. Animals are made transgenic through 

35 administration of a composition comprising a isansposon-based vector designed for 
stable incorporation of a gene of interest for production of a desired protein, together 

3 
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with an acceptable carrier. A transaction reagent is optionally added to the 
composition before administration. 

The transposon-based vectors of the present invention include a transposase, 
operably-linked to a first promoter, and a coding sequence for a protein or peptide of 
5 interest operably-linked to a second promoter, wherein the coding sequence for the 
protein or peptide of interest and its operably-iinked promoter are flanked by 
transposase insertion sequences recognized by the transposase. The transposon-hased 
vector also includes the following characteristics: a) one or more modified Kozak 
sequences comprising ACCATG (SEQ ID NO: 13) at the 3 5 end of the first promoter 

10 to enhance expression of the transposase; b) modifications of the eodons for the first 
several N4errninal amino acids of the transposase 8 wherein the nucleotide at the third 
base position of each codon was changed to an A or a T without changing the 
corresponding amino acid; c) addition of one or more stop codons to enhance the 
termination of transposase synthesis; and/or, d) addition of an effective polyA 

15 sequence operably-Mnked to the transposase to further enhance expression of the 
transposase gene. 

Use of the compositions of the present invention results an highly efficient and 
stable incorporation of a gene of interest into the genome of transacted animals. For 
example, transgenic avians have been mated and produce transgenic progeny in the 

20 Gl generation. The transgenic progeny have been mated and produce transgenic 
progeny in the G2 generation. 

The present invention also provides for tissue-specific incorporation and/or 
expression of a gene of interest. Tissue-specific incorporation of a gene of interest 
may be achieved by placing the transposase gene under die control of a tissue-specific 

25 promoter, whereas tissue-specific expression of a gene of interest may be achieved by 
placing the gene of interest under the control of a tissue-specific promoter. In some 
embodiments, the gene of interest is transcribed under the influence of an ovalbumin, 
or other oviduct specific, promoter. Linking the gene of interest to an oviduct specific 
promoter in an egg-laying animal results in synthesis of a desired molecule and 

30 deposition of the desired molecule in a developing egg. The present invention further 
provides for stable incorporation and expression of genes in the epithelial cells of the 
mammary gland in milk-producing animals. Transcription of the gene of interest in 
the epithelial cells of the mammary gland results in synthesis of a desired molecule 
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and deposition of the desired molecule in the milk. A preferred molecule is a protein. 
In some embodiments, the desired molecule deposited in the milk is an antiviral 
protein, an antibody, or a serum protein. 

In other embodiments, specific incorporation of the proinsulin gene into laver 
5 cells of a diabetic animal results in the improvement of the animal's condition/ Such 
improvement is achiever] by placing a iransposase gene under the control of a liver- 
specific promoter, which drives integration of the gene of interest in liver cells of the 
diabetic animal. 

The present invention advantageously produces a high number of transgenic 

10 animals having a gene of interest stably incorporated. These transgenic animals 
successfully pass the desired gene to their progeny. The transgenic animals of the 
present invention also produce large amounts of a desired molecule encoded by the 
transgene, Transgenic egg-laying animals, particularly avians,, produce large amounts 
of a desired protein that is deposited in the egg for rapid harvest and purification. 

15 Transgenic milk-producihg animals 'produce large amounts of a desired protein that is 
deposited in the milk for rapid harvest and purification. 

Any desired gene may be incorporated Into the novel transposon-based vectors 
of the present invention in order to synthesize a desired molecule m the transgenic 
animals. Proteins, peptides and nucleic acids are preferred desired molecules to be 

20 produced by the transgenic animals of the present invention. Particularly preferred 
proteins are antibody proteins. 

This invention provides a composition useful for the production of transgenic 
hens capable of producing substantially high amounts of a desired protein or peptide, 
Entire flocks of transgenic birds may be developed very quickly in order to produce 

25 industrial amounts of desired molecules. The present invention solves the problems 
inherent in the inadequate capacity of fermentation facilities used for bacterial 
production of molecules and provides a more efficient and economical way to 
produce desired molecules. Accordingly, the present invention provides a means to 
produce large amounts of therapeutic, diagnostic and reagent molecules, 

30 'Transgenic chickens are excellent in terms of convenience and efficiency of 

manufacturing molecules such as proteins and peptides. Starting with a single 
transgenic rooster, thousands of transgenic offspring can be produced within a year, 
(In principle, up to forty million offspring could be produced in just three 
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generations). Each transgenic fermk is expected to lay at least 250 eggs/year, each 
potentially containing hundreds of milligrams of the selected protein. Flocks of 
chickens numbering in the hundreds of thousands are readily handled through 
established commercial systems. The technologies for obtaining eggs and 
5 fractionating them are also well known and widely accepted. Thus, for each 
therapeutic, diagnostic, or other protein of interest, large amounts of a substantially 
pure material can be produced at relatively low incremental cost, 

A wide range of recombinant peptides and proteins can be produced in 
transgenic egg-laying animals and milk-producing animals. Enzymes, hormones, 

1 0 antibodies, growth factors, serum proteins, commodity proteins, biological response 
modifiers, peptides and designed proteins may ail be made through practice of the 
present invention. For example, s ough estimates suggest that it is possible to produce 
in bulk growth hormone, insulin, or Factor VIII, and deposit them in transgenic egg 
whites* for an incremental cost in the order of one dollar per gram. At such prices it 

15 is feasible to consider administering such medical agents by inhalation or even orally, 
instead of through injection. Even if bioavailability rates through these avenues were 
low, the cost of a much higher effective-dose would not be prohibitive. 

In one embodiment, the egg-Saying transgenic animal is an avian. Hie method 
of the present invention may be used in avians including Ratites* Fsittaciformes, 

20 Faieonifomies, Piciformes, Strigiformes, Passerifoimes, Coraciformes, Ralliformes, 
Cuculifosrnes, Columbiformes, Galliformes, Anserifonnes* and Herodiones. 
Preferably, the egg-laying transgenic animal is a poultry bird. More preferably, the 
bird is a chicken, turkey, dock, goose or quail. Another preferred bird is a ratite, such 
as, an emu, an ostrich, a rhea, or a cassowary. Other preferred birds are partridge, 

25 pheasant, kiwi, parrot, parakeet, macaw, falcon 4 eagle, hawk, pigeon, cockatoo, song 
birds, jay hiird 5 blackbird, finch, warbler, canary, toucan, rnynah, or sparrow. 

In another embodiment, the transgenic animal is a milk-producing animal, 
including but not limited to bovine, ovine, porcine, equine, and primate animals. 
Milk-producing animals include but are not limited to cows, goats, horses, pigs, 

30 buffalo, rabbits, non-human primates, and humans, 

Accordingly, it is an object of the present invention to provide novel 
transposom-based vectors. 

6 
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It is another object of the present invention to provide novel transposon-based 
vectors that encode for the production of desired proteins or peptides in cells. 

It is an object of the present invention to produce transgenic animals through 
administration of a 

5 Another object of the present invention is to produce transgenic animate 

through administration of a transposes -based vector, wherein the transgenic animals 
produce desired proteins or peptides. 

Yet another object of the present invention is to produce transgenic animals 
through administration of a transposon-based vector, wherein the transgenic animals 
10 produce desired proteins or peptides and deposit the proteins or peptides in eggs or 
milk. 

It is a further object of the present invention to produce transgenic animals 
through intraemhryonsc, intratestieular or mtraoviduetal administration of a 
transposon-based vector, 
15 It is further an object of the present invention to provide a method to produce 

transgenic animals through administration of a transposon-based vector that are 
capable of producing transgenic progeny; 

Yet another object of the present invention is to provide a method to produce 
transgenic animals through administration of a tr&nsposon-based vector that are 
capable of producing a desired molecule, such as a protein, peptide or nucleic acid. 

Another object of the present invention is to provide a method to produce 
transgenic animals through administration of a transposon-hased vector, wherein such 
administration results in modulation of endogenous gene expression. 

It is another object of the present invention to provide transposon-vectors 
5 useful for cell- or tissue-specific expression of a gene of interest in an animal or 
human with the purpose of gene therapy. 

It is yet another object of the present invention to provide a method to produce 
transgenic avians through administration of a transposon-hased vector that are capable 
of producing proteins, peptides or nucleic acids. 

It is another object of the present invention to produce transgenic animals 
through administration of a transpose rc-hased vector encoding an antibody or a 
fragment thereof. 
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Still another object of the present invention is to provide a method to produce 
transgenic avians through administration of a transposon-based vector that are capable 
of producing proteins or peptides and depositing these proteins or peptides in the egg. 

Another object of the present invention is to provide transgenic avians that 
5 contain a stably incorporated transgene. 

Still another object of the present invention is to provide eggs containing 
desired proteins or peptides encoded by a transgene incorporated into the transgenic 
a vian that produces the egg. 

It is further an object of the present invention to provide a method to produce 
10 transgenic milk-produciaig animals through administration of a tansposon-based 
vector that are capable of producing proteins, peptides or nucleic acids. 

Still another object of the present invention is to provide a method to produce 
transgenic milk-producing animals through administration of a transposon-based 
vector that are capable of producing proteins or peptides and depositing these proteins 
15 or peptides in their milk. 

Another object of the present invention is to provide transgenic milk- 
producing animals that contain a stably incorporated transgene. 

Another object of the present invention is to provide transgenic milk- 
producing animals that are capable of producing proteins or peptides and depositing 
20 these proteins or peptides in their milk. 

Yet another object of the present invention is to provide milk containing 
desired molecules encoded by a transgene incorporated into the transgenic milk* 
producing animals that produce the milk. 

Still another object of the present invention is to provide milk containing 
25 desired proteins or peptides encoded by a transgenic incorporated into the transgenic 
milk-producing animals that produce the milk, 

A further object of the present invention to provide a method to produce 
transgenic sperm through administration of a transposon-based vector to an animal, 

A further object of the present invention to provide transgenic sperm that 
30 contain a stably incorporated transgene. 

An advantage of the present invention is that transgenic animals are produced 
with higher efficiencies than observed in the prior art. 

8 
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Another advantage of the present invention is that these transgenic animals 
possess high copy numbers of the transgene. 

Another advantage of the present invention is that the transgenic animals 

GO 

produce large amounts of desired molecules encoded by the transgene* 
5 Still another advantage of the present invention is that desired molecules are 

produced by the transgenic animals much more efficiently and economically than prior 
art methods, thereby providing a means for large scale production of desired molecules, 
O particularly proteins and peptides. 



\0 According to the invention there is also provided a vector comprising; 

10 a) a modified transposase gene operabiy-l inked to a first promoter; wherein the 



nucleic acid sequence 3' to the first promoter comprises the sequence as set forth in 
^ SEQ ID NO; 1 3 S wherein SEQ ID NO; 13 contains the Kozak sequence and a start codon 

for the transposase, and wherein at least one of the first twenty codons for the 
transposase gene are modified from the wild-type sequence by changing a nucleotide at 
1 5 a third base position of the codon to an adenine or thymine without modifying the 
amino acid encoded by the codon, and 

b) one or more genes of interest operabiy-linJced to one or more additional 
promoters; and wherein the one or more genes of interest and their operably-iiaiked 
promoters are flanked by transposase insertion sequences recognized by the transposase 
20 encoded by the modified transposase gene. 

According to the invention there is also provided a method of producing a 
transgenic animal comprising administering to the animal a vector according to the 
invention. 

According to the invention there is also provided m egg produced by the 
25 transgenic avian animal according to the invention, wherein the egg contains one or 
more desired proteins encoded by the one or more genes of interest. 

According to the invention there is also provided a transgenic sperm produced 
by the transgenic animal produced according to the invention. 

According to the invention there is also provided a method for producing a 
30 desired protein comprising: 

a) administering to the animal a vector according to the invention; and 

b) isolating the desired protein produced in the animal, 

Throughout this specification the word "comprise", or variations such as 
"comprises" or "comprising", will be understood to imply the inclusion of a stated 
35 element, integer or step, or group of elements, integers or steps, but not the exclusion of 
any other element, integer or step, or group of elements* integers or steps. Any 
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discussion of documents, acts, materials, devices, articles or the like which has been 
included in the present specification is solely for the purpose of providing a context for 
the present invention. It is not to be taken as an admission that any or all of these 
matters form part of the prior art base or were common general knowledge in the field 
relevant to the present invention as it existed before the priority date of each claim of 
this application. 

These and other objects, features and advantages of the present invention will 
become apparent after a review of the following detailed description of the disclosed 
embodiments and claims. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts schematically a transposon-based vector containing a 
transposase operably linked to a first promoter and a gene of interest operably-Hnked to 
a second promoter, wherein the gene of interest and its operably-linked promoter are 
flanked by insertion sequences (IS) recognized by the transposase. "Pro" designates a 
promoter, in this and subsequent figures, the size of the actual nucleotide sequence is 
not necessarily proportionate to the box representing that sequence, 

Figure 2 depicts schematically a transposon.-*based vector for targeting 
deposition of a polypeptide in an egg white wherein Ov pro is the ovalbumin promoter, 
Ov protein is the ovalbumin protein and PoSyA is a poiyadenylation sequence. The 
TAG sequence includes a spacer, the gp41 hairpin loop from HIV I and a protein 
cleavage site, 

Figure 3 depicts schematically a transposon-based vector for targeting 
deposition of a polypeptide in an egg white wherein Ovo pro is the ovomucoid 
promoter and Ovo SS is the ovomucoid signal sequence. The TAG sequence includes a 
spacer, the gp41 hairpin loop from HIV I and a protein cleavage site. 
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Figure 4 depicts schematically a transposon-based vector for targeting 
deposition of a polypeptide in an egg yolk wherein Vit pro is the vitellogenin 
promoter and Vit targ is the vitellogenin targeting sequence. 

5 Figure 5 depicts schematically a transposon-based vector far expression of 

antibody heavy and light chains. Prepro indicates a prcpro sequence from cecropin 
and pro indicates a pro sequence from "cecropin, 

Figure 6 depicts schematically a transposon-based vector for expression of 
1 0 antibody heavy and light chains Erst indicates an enterokinase cleavage sequence. 

Figure 7 depicts schematically egg white targeted expression of antibody 
heavy- and light chains from one vector in either tail-to-t&iS (Figure 7A) or tail-to-head 
(Figure 7B) configuration. In the tail-to-tail configuration, ill© ovalbumin signal 

1 5 sequence adjacent to the gene for the light chain contains on its 3 5 end an enterokinasc 
cleavage site (not shown} to allow cleavage of the signal sequence from the light 
chain, and the ovalbumin signal sequence adjacent to the gene for the heavy chain 
contains on its 5' end an enterokinase cleavage site (not shown) to allow cleavage of 
the signal sequence from the heavy chain. In the tail-to-head configuration, the 

20 ovalbumin signal sequence adjacent to the gene tor the heavy chain and the light 
chain contains on its 3 l end an enterokinase cleavage site (not shown) to allow 
cleavage of the signal sequence from the heavy or light chain, 

DETAILED DESCRIPTION OF THE INVENTION 

25 The present invention provides a new, effective and efficient method of 

producing transgenic animals, particularly egg-laying animals and milk-producing 
animals, through administration of a composition comprising a transposon-based 
vector designed for stable incorporation of a gene of interest fox production of a 
desired n^olccnlc, 

30 Definit ions 

It is to be understood that as used In the specification and in the claims, "a" or 
"an" can mean one or more, depending upon the context in which it is used, Thus, for 
example, reference to "a cell" can mean that at least one cell can be utilized. 
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The term "antibody" is used interchangeably with Che term "immunoglobulin" 
and is defined herein as a protein synthesized by an animal or a cell of the immune 
system in response to the presence of a foreign substance commonly referred to as an 
"antigen" or an "immunogen". The term antibody includes fragments of antibodies. 
5 Antibodies are characterized by specific affinity to a site on the antigen, wherein the 
site is referred to an "antigenic determinant 5 ' or m "epitope". Antigens can be 
naturally occurring or artificially engineered. Artificially engineered antigens include 
but are not limited to small molecules, such as small peptides, attached to haptens 
such as macromolecules, for example proteins, nucleic acids, or polysaccharides, 
10 Artificially designed or engineered variants of naturally occurring antibodies and 
artificially designed or engineered antibodies not occurring in nature are all included 
in the current definition. Such variants include conservatively substituted amino acids 
and other forms of substimtion as described in the section concerning proteins and 
polypeptides, 

15 As used herein, the term "egg-laying anirnaF' includes all arnniotes such as 

birds, turtles, lizards and monotrernes. Monotremes are egg-laying mammals and 
include the platypus and echidna. The term "bird" or "fowl/ 5 as used herein, is 
defined as a member of the Aves class of animals which are characterized as warm- 
blooded, egg-laying vertebrates primarily adapted for flying. Avians include, without 

20 limitation Ratites, Psittaciformes, Falconiforrnes, Picifonnes, Strigiformes, 
Passeri formes, Coraciformes, Rallifcmmes, Cucuiiformes, Columbiformes, 
Gallifonnes, Anseriformes, and Herodiones. The term -"Ratite," as used herein, is 
defined as a group of flightless, mostly large, running birds comprising several orders 
and including the emus, ostriches, kiwis, and cassowaries. The term "Psirtaciformes", 

25 as used herein, includes parrots and refers to a monofamilial order of birds that exhibit 
sygodactylism and have a strong hooked bill. A "parrot" is defined as any member of 
the avian family Fsittacidae (the single family of the Psittaciformes), distinguished by 
the short, stout, strongly hooked beak. The term "chicken" as used herein denotes 
chickens used for table egg production* such as egg-type chickens, chickens reared for 

30 public meat consumption, or broilers, and chickens reared for both egg and meat 
production ("dual-purpose" chickens). The term ^chiokerr" also denotes chickens 
produced by primary breeder companies, or chickens that are the parents, 
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grandparents, great-grandparents, etc. of those chickens reared for public table egg, 
meat* or table egg and meat consumption. 

The term "egg" is defined herein as a large female sex cell enclosed m a 
porous,, ealcarous or leathery shell, produced by birds and reptiles. The terns "ovwrn" 
5 is defined as a female gamete, and is also known as an egg. Therefore, egg 
production in all animals other than birds and reptiles, as used herein, is defined as the 
production and discharge of an ovum from an ovary, or "ovulation According it 
is to fee understood that the ienn "egg r as used herein is defined as a large female sex 
cell enclosed in a porous, ealcarous or leathery shell, when a bird or reptile produces 
10 it, or it is an ovum when It is produced by all other animals. 

The term "milk-producing animal" refers herein to mammals including, but 
not limited to, bovine, ovine, porcine, equine, and primate animals. Milk-producing 
animals include but are not limited to cows, Hamas, camels, goats, reindeer, zeba* 
water buffalo, yak, horses, pigs, rabbits* non-human primates, and humans, 
15 The term "gene" is defined herein to include a coding region for a protein, 

peptide or polypeptide. 

The term "vector" is used interchangeably with the terms "construct", **DNA 
construct" and "genetic construct" to denote synthetic nucleotide sequences used for 
manipulation of genetic material, including but not limited to cloning, subclonmg, 
20 sequencing, or introduction of exogenous genetic material into ceHs, tissues or 
organisms, such as birds, It is understood by one skilled in the art that vectors may 
contain synthetic BNA sequences, naturally occurring DNA sequences, or both. The 
vectors of the present invention are transposon-based vectors as described herein. 

When referring to two nucleotide sequences, one being a regulatory sequence, 
25 the term "operahly-fiinked" is defined herein to mean that the two sequences are 
associated in a manner that allows the regulatory sequence to affect expression of the 
other nucleotide sequence. It is not required that the operably-imked sequences be 
directiy adjacent to Ofse another with no intervening sequence(s). 

The term "regulatory sequence" is defined herein as including promoters, 
30 enhancers and other expression control elements such as polyadenyladon sequences, 
matrix attachment sites, insulator regions for expression of multiple ge«es on a single 
construct, ribosome entry/attaehnient sites, introns that are able to enhance 
expression, and silencers, 
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Trans|XK^on~Based Vectprs 

While not wanting to be bound by the following statement, it is believed that 
the nature of die DNA construct is an important factor in successfully producing 
transgenic animals. The "standard" types of plasrnid and viral vectors that have 
5 previously been almost universally used for transgenic work in all species, especially 
avians, have low efficiencies and may constitute a major reason for the low rates of 
transformation previously observed. Hie DNA (or RNA) constructs previously used 
often do not integrate into the host DNA, or integrate only at low frequencies. Other 
factors may have also played a part, such as poor entry of the vector into target cells. 

10 The present invention provides tarasposon-based vectors that can be administered to 
an animal that overcome the prior ait problems relating to low transgene integration 
frequencies. Two preferred transposon-h&sed vectors of the present invention in 
which a tranposase, gene of interest and other polynucleotide sequences may be 
introduced are termed pTnMCS (SEQ ID NO:36) and pTnMod (SEQ ID NO: 1). 

15 The transposon-hased vectors of the present invention produce integration 

frequencies an order of magnitude greater than has been achieved with previous 
vectors. More specifically, intratestieular injections performed with a prior art 
transposon-based vector {described in ILS. Patent No. 5,719,055) resulted in 41% 
sperm positive roosters whereas intra testicular injections performed with the novel 

20 transposon-hased vectors of the present invention resulted in 77% sperm positive 
roosters. Actual frequencies of integration were estimated by either or both 
comparative strength of the PCR. signal from the sperm and histological evaluation of 
the testes and sperm by quantitative PCR. 

The transposon-based vectors of the present invention include a transposase 

25 gene operably-iinked to a first promoter, and a coding sequence for a desired protein 
or peptide operahiy4inked to a second promoter, wherein the coding sequence for the 
desired protein or peptide and its operabry-Iinked promoter are flanked by transposase 
insertion sequences recognized by the transposase. The trau&poson-based vector also 
includes one or more of the following characteristics: a) one or more modified Kozak 

30 sequences comprising ACCATG (SEQ ID NO: 13) at the 3' end of the first promoter 
to enhance expression of the transposase; b) modifications of the codons for the first 
several N-termin&l amino acids of the transposase, wherein the third base of each 
codon was changed to an A or a T without changing the corresponding amino acid; c) 
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addition of one or more stop codons to enhance the termination of transposase 
synthesis; and, d) addition of an effective polyA sequence operably-linked to the 
transposase to farther enhance expression of the transposase gene. Figure 1 shows a 
schematic representation of several components of the transposon-based vector. The 
5 present invention farther includes vectors containing more than one gene of interest, 
wherein a second or subsequent gene of interest is ©perably-linked to the second 
promoter or to a different promoter- It is also to be understood that the transposon- 
based vectors shown in the Figures are representational of the present invention and 
that the order of the vector elements may be different than that shown in the Figures, 
10 that the elements may be present in various orientations, and that the vectors may 
contain additional elements not shown in fee Figures. 

In a further embodiment of the present invention, the transposase found in the 
transposase-based vector is an altered target site (ATS) transposase and the insertion 

1 5 sequences are those recognized by the ATS transposase. However, €ie transposase 
located in the transposase-based vectors is not limited to a modified ATS transposase 
and can be derived from any transposase, Transposases known in the prior art include 
those found in ACT, TnSSEQL, 1*916, Tn951, Tnl 721, Tn 2410, Tnl681 5 Tnl, Ts>2 $ 
Tn3, Tn4, Tn5, Tn6, Tn9, TstlO, Tn30, Tn!G! s Tn903, TnSOl, Tnl 000 (yS), Tnl68I 5 

20 Tn290i s AC transposons, Mp traiKposons, Spm transposons, En transposons, Dotted 
transposons, Mu transposons, Ds transposons, dSpm transposons and I transposons. 
According to the present invention, these transposases and their regulatory sequences 
are modified for improved functioning as follows: a) the addition one or more 
modified Kozak sequences comprising ACCATG (SEQ IB NO:13) at the 3' end of 

25 the promoter operably-Iinked to the transposase; b) a change of the codons for the first 
several amino acids of the transposase, wherein the third base of each codon was 
changed to an A or a T without changing the corresponding amino acid; c) the 
addition of one or more stop codons to enhance the termination of transposase 
synthesis; and/or, d) the addition of an effective poly A sequence operabty-linked to 

30 the transposase to further enhance expression of the transposase gene. 

Although not wanting to be bound by the following statement, it is believed 
that the modifications of the first severs! N-terminal codons of the transposase gene 
increase transcription of the transposase gene, in part* by increasing strand 

14 
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dissociation. It is preferable that between approximately 1 and 20* more preferably 3 
and 15, and mast preferably between 4 and 12 of the first N«temr£nal eodons of the 
transposase are modified such that the third base of sack codon is changed to an A or 
a T without changing the encoded amino acid. In one embodiment, the first ten N- 
5 terminal codcus of the transposase gene are modified in this manner. It is also 
preferred that the transposase contain mutations that make it less specific for preferred 
insertion sites and thus increases the rate of transgeme insertion as discussed in U.S. 
Patent No, 5,7 19,055. 

In some embodiments, the transposon-based vectors are optimized for 
10 expression in a particular host by changing the methylation patterns of the vector 
DNA, For example, prokaryotic rnet&ylation may fee reduced by using a methylation 
deficient organism for production of the Iransposon-based vector. Hie transposon- 
based vectors may also be methylated to resemble eukaryotic DNA for expression in a 
eukaryotie host, 

15 Transposases and insertion sequences front* other anaiogot3S eukaryotic 

transposorv-hased vectors that can also be modified and nsed are, for example, the 
Drosophila P element derived vectors disclosed in U,S. Patent No. 6,291,243; the 
Drosophila mariner element described in Sherman et ah (1998); or the sleeping beauty 
transposes See also Haekett et ai (1999); D, Larnpe et aL, 1999. Proc. Math Acad. 

20 Sci USA, 96:11428-11433; S. Fischer et aL, 2001. Proc. NatL Acad, ScL USA, 
98:6759-6764; L, Zagoraiou et aL, 2001. Proc, Natl, Acad. Sci. USA S 9S: 11474- 
11478; and D, Berg et aL (Bds.) s Mobile DNA, Amer. Soe, Microbiol. (Washington, 
19S9). However, it should be noted that bacterial transposon-based elements 
are preferred, as there is less likelihood that a eukaryotic transpos&se in the recipient 

25 species will recognize prokaryotic insertion sequences bracketing the trarssgene, 

Many transposes recognise different insertion sequences, and therefore, it is 
to be understood that a transposase-based vector will contain insertion sequences 
recognised by the particular tr&tisposase also found in the transposase-based vector, 
In a preferred embodiment of the invention, the insertion sequences have been 

30 shortened to about 70 base pairs in length as compared to those found in wild-type 
transposons that typically contain insertion sequences of well over 100 base pairs. 

While the examples provided below incorporate a "cut and insert 59 TnIO based 
vector that is destroyed following the insertion event, the present invention also 
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encompasses the use of a "rolling replication" type transposon-based vector. Use of a 
roiling replication rype transposon allows multiple copies of the traxisposoivtransgene 
to be Jinade from a single transgene construct and the copies inserted, This type of 
transposon-based systesn thereby provides for insertion of multiple copies of a 
transgene into a single genome, A rolling replication type transposon-based vector 
may be preferred when the promoter operahfy-linked to gene of interest is endogenous 
to the host cell and present in a high copy number or highly expressed. However, use 
of a rolling replication system may require tight control to limit the insertion events to 
non-lethal levels, Tnl, Tn2, Tn3„ Tn4, Tn5 s Tn9, Trail, TnSOl, Tn551 5 Tn951, 
TnI721, Tn24I0 and Tn2603 are examples of a rolling replication type transposon, 
although Tn5 could he both a rolling replication and a cut and insert type transposon. 
Stop Codons and PolyA Sequences 

In one embodiment, the transposon-based vector contains two stop codons 
operably-Hnked to the transposase and/or to the gene of interest. In an alternate 
embodiment, one stop codes* of UAA or UGA is operably linked to the transposase 
and/or to the gene of interest, As used herein an "effective poiyA sequence" refers to 
either a synthetic or non-synthetic sequence that contains multiple and sequential 
nucleotides containing an adenine base (an A polynucleotide string) and that increases 
expression of the gene to which it is operably-hnked. A poly A sequence may be 
operahly-linked to any gene in the transposon-hased vector includiEig, but not limited 
to, a transposase gene and a gene of interest In one embodiment, a polyA sequence 
comprises the polynucleotide sequence provided in SBQ ID NO:2S, A preferred 
poly A sequence is optimised for use in the host animal or human. In one 
embodiment, the polyA sequence is optimized for use in a bird, and more specifically, 
a chicken. The chicken optimized poiyA sequence generally contains a minimum of 
60 base pairs, and more preferably between approximately 60 and several hundred 
base pairs, that precede the A polynucleotide string and thereby separate the stop 
codon from the A polynucleotide Strang, A chicken optimized polyA sequence may 
also have a reduced amount of CT repeats as compared to a synthetic polyA sequence. 
In one embodiment of the present invention, the polyA sequence comprises a 
conalbumin polyA sequence as provided in SBQ ID NO;33 and as taken from 
GenBank accession # Y00407, base pairs 10651 - 1 1058, 
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The first promoter operably-linked to the transposes gene and the second 
promoter operahiy-Imked to the gene of interest can be a constitutive promoter or an 
inducible promoter. Constitutive promoters include, but are not limited to, immediate 
5 early cytomegalovirus (CMV) promoter, herpes simplex virus 1 (HSVl) immediate 
early promoter, SV40 promoter, iysozyme promoter, early and late CMV promoters, 
early and late HSV promoters, (3-actin promoter, tubulin promoter, Rons-Sarcoma 
virus (RSV) promoter, and heat-shock protein (HOT) promoter. Inducible promoters 
include tissue-specific promoters, developmentslly-regulated promoters and 

10 chemically inducible promoters. Examples of tissue-specif e promoters include the 
glucose 6 phosphate (G6P) promoter, vitellogenin promoter, ovalbumin promoter, 
ovomucoid promoter, eonalbumin promoter, ovotransferrin promoter, prolactin 
promoter, kidney uromoduiin promoter, and placental lactogen promoter. In one 
embodiment, the vitellogenin promoter includes a polynucleotide sequence of SEQ ID 

15 NO: 17. The G6P promoter sequence may be deduced from a rat G6F gene 
untranslated upstream region provided in GenBank accession number U57552.1. 
Examples of developmentaOy-regulated promoters include the homeobox promoters 
and several hormone induced promoters. Examples of chemically inducible 
promoters include reproductive hormone induced promoters and antibiotic inducible 

20 promoters such as the tetracycline inducible promoter and the sinc-inducibie 



Other inducible promoter systems include the Lac operator repressor system 
inducible by IPTG (isopropyl beta-D-thiogalactoside) (Cronin, A. et &L 2001, Genes 
and Development, v. 15), eedysone-based inducible systems (Hoppe, U, C. et aL 

25 2000, MoL Then 1:159-164); estrogen-based inducible systems (Braselmasm, et a!. 
1993, Proe. Natl. Acad. ScL 90:1657-1661); progesterone-based Inducible systems 
using a chimeric regulator. GLVP, which is a hybrid protein consisting of the GAL4 
binding doxnain and the herpes simplex virus transcriptional activation domain, VP 16, 
and a truncated form of the human progesterone receptor that retains the ability to 

30 bind iigand and can be turned on by RU486 (Wang, et al. 1994, Proc. Natl. Acad. Sol 
91:8180-8184); CID~based inducible systems using chemical inducers of dimerization 
(CIBs) to regulate gene expression, such as a system wherein rapamyein indices 
dimerization of the cellular proteins FKBF12 and FRAP (Belshaw, P. I et ah 1996. J. 
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Chem. Biol. 3:731-738; Fan, L. et al. 1999. Hum. Gene Ther. 10:2273-2285; Shariat, 
S.F. et al. 2001. Cancer Res. 61:2562-2571; Spencer, D.M. 1996. Curr. Biol. 6:839- 
847). Chemical substances that activate the chemically inducible promoters can be 
administered to the animal containing the transgene of interest via any method known 
to those of skill in the art. 

Other examples of cell or tissue-specific and constitutive promoters include 
but are not limited to smooth-muscle SM22 promoter, including chimeric 
SM22aipha/telokin promoters (Hoggatt A.M. et al, 2002, Circ Res. *)1(!2):1 151-9); 
ubiquitin C promoter (Biochim Biophys Acta, 2003, Jan, 3 ; 1 625( i); 52-63) ; H$f2 
promoter; murine COMP (cartilage oHgomedc matrix protein) promoter; early B cell- 
specific mb-1 promoter (Sigvardsson M., et al, 2002. Mot Cell Biol. 22(24); 8539- 
51); prostate specific antigen (PSA) promoter (Yoshimura I. et al., 2002, J. Urol. 
1 68(6);2659-64); exorh promoter and pineal expression-promoting element (Asaoka 
Y., et al., 2002. Proc. Natl. Acad. Set 99(24): 15456-61); neural and liver ceramidase 
gene promoters (Okino N. et al s 2002. Biochem. Biophyia. Res. Commun, 
299(1): 160-6); PSP94 gene promoter/enhancer (Gabril M.Y. et &L, 2002. Gene Ther. 
9(23): 1589-99); promoter of the. human FAT/CD36 gene (Kuriki C, et at, 2002. Biol 
Fhaim. Bull, 25(11): 1476-8); VL30 promoter (Staplin W.R. et al., 2002. Blood 
October 24, 2002); IL-10 promoter (Brenner S., et al., 2002. J. Biol. Chem. December 
1 8, 2002). 

Examples of avian promoters include, but are not limited to, promoters 
controlling expression of egg white proteins, such as ovalbumin, ovotransferrin 
(conalbumin), ovomucoid, lysozyme, ovomucin, g2 ovoglobulin, g3 ovoglobulin, 
ovoflavoprotein, ovostatin (ovomacroglobin), cystatin, avidin, thiamine-binding 
protein, glutamyl aminopeptidase minor glycoprotein 1, minor glycoprotein 2; and 
promoters controlling expression of egg-yolk proteins, such as vitellogenin, very low- 
density lipoproteins, low density lipoprotein, cobalamin-binding protein, riboflavin- 
hmding protein, hiotm4ttnding protein (Awade s 1996, Z, Lebensm. Unters. Forsch. 
202:1-14). An advantage of using the vitellogenin promoter is that it is active during 
the egg-laying stage of an animal's life-cycle, which allows for the production of the 
protein of interest to be temporally connected to the import of the protein of interest 
into the egg yolk when the protein of interest is equipped with an appropriate 
targeting sequence. 
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Liver-specific promoters of the present invention include, but are not Hoiited 
to, the following promoters, vitellogenin promoter, G6P promoter, cholesterol-?- 
alpha-hydroxylase (CYP7A) promoter, phenylalanine hydroxylase (PAH) promoter, 
protein C gene promoter, insulin-like growth factor I (XGF-I) promoter, bilirubin 
5 UDP-glueuronosylSxansferase promoter, aldolase B promoter, furin promoter, 
metalioihioneine promoter, albumin promoter, and insulin promoter, 

Also included in the present invention are promoters that can be used to target 
expression of a protein of interest into the milk of a milk-producing animal including, 
but not limited to, p lactoglobin promoter, whey acidic protein promoter, iactalbumin 

1 0 promoter and casein promoter. 

Promoters associated with cells of the immune system may also be used. 
Acute phase promoters such as interlenkin (IL)~i and IL~2 may be employed. 
Promoters for heavy and light chain Ig may also be employed. The promoters of the 
T ceil receptor components CB4 and CDS, B cell promoters and the promoters of 

1 5 CR2 (complement receptor type 2) may also be employed. Immune system promoters 
are preferably used when the desired protein is an antibody protein, 

Also included in this invention are modified promoters/enhancers wherein 
elements of a single promoter are duplicated, modified, or otherwise changed- In one 
embodiment, steroid hormone-binding domains of the ovalbumin promoter are moved 

20 from about -6,5 kb to within approximately the first 1000 base pairs of the gene of 
interest Modifying an existing promoter with promoter/enhancer elements not found 
naturally in the promoter, as well as building an entirely synthetic promoter, or 
drawing promoter/enhancer elements from various genes together on a non-natural 
backbone, are all encompassed by the current invention. 

25 Accordingly, it is to be understood that the promoters contained within the 

transposon-based vectors of the present invention may be entire promoter sequences 
or fragments of promoter sequences. For example, in one em1x>diment ? the promoter 
operably linked to a gene of interest is an approximately 900 base pair fragment of a 
chicken ovalbumin promoter (SEQ ID NO:4D), The constitutive and inducible 

30 promoters contained within the transposon-based vectors may also be modified by the 
addition of one or more modified Kozak sequences of ACCATG (SEQ ID NO:13)< 

As indicated above, the present invention includes transposon-based vectors 
containing one or more enhancers. These enhancers may or may not be operably- 
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herein as an "enhanced promoter " The enhancers corded within the transpose*!- 
based vectors are preferably enhancers found in birds, and more preferably, an 
5 ovalbumin enhancer, but are not limited to these types of enhancers. In one 
embodiment, an approximately 675 base pair enhancer element of an ovalbumin 
promoter is efoned upstream of an ovalbumin promoter with 300 base pairs of spacer 
DNA separating the enhancer and promoter. In one embodiment, the enhancer used 
as a part of the present invention comprises base pairs 1-675 of a Chicken Ovalbumin 
enhancer from Ge&Bank accession #882527 J k The polynucleotide sequence of this 
enhancer is provided hi SEQ IDNG;37. 

Also included in some of the transposon-based vectors of the present invention 
are cap sites and fragments of cap sites In one embodiment* approximately 50 base 
pairs of a 5 ? untranslated region wherein the capsite resides are added on the 3' end of 
an enhanced promoter or promoter. An exemplary 5 e xsntranslated region is provided 
in SEQ ID NG;38. A putative cap-site residing in this 5 f untranslated region 
preferably comprises the polynucleotide sequence provided in SEQ ID NO;39, 

In one embodiment of the preset invention, the first promoter openAly-linked 
to the transposase gene is a constitutive promoter and the second promoter operahly- 
linked to the gene of interest is a tissue-specific promoter. In this embodiment, use of 
the first constitutive promoter allows for constitutive activation of the transposase 
gene and incorporation of the gene of interest into virtually all cell types, including 
the gerrrdine of the recipient animal Although the gene of interest is incorporated 
into the gerrnlin© generally, the gene of interest is only expressed m a tissue-specific 
25 manner. It should be noted that cell- or tissue- specific expression as described herein 
does not require a complete absence of expression in cells or tissues other than the 
preferred eel! or tissue. Instead, "cell-specific" or tissue-specific" expression refers 
to a majority of the expression of a particular gene of interest in the preferred cell or 
tissue, respectively. 

When incorporation of the gene of interest into the germHne is not preferred, 
the first promoter operably-1 inked to the transposase gene can be a tissue-specific 
promoter. For example, transection of a transposon-based vector containing a 
transposase gene operahly-Iinked to a liver-specific promoter such as the G6F 
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promoter or vitellogenin promoter provides for activation of the transposase gene and 
incorporation of the gene of interest in the cells of the liver but not into the germline 
and other cells generally. In this second embodiment, the second promoter operably- 
linked to the gene of interest can be a constitutive promoter or an Inducible promoter. 
5 In a preferred embodiment* both the first promoter and the second promoter are a G6P 
promoter. In embodiments wherein tissue-specific expression or incorporation is 
desired, it is preferred that the transposon-based vector is administered directly to the 
tissue of interest or to an artery leading to the tissue of interest 

Accordingly, cell specific promoters may be nsed to enhance transcription in 

10 selected tissues. In birds, for example, promoters that are found in cells of the 
fallopian tube $ such as ovalbumin, eonalbuxnm, ovomucoid and'br iysozyme, are used 
in the vectors to ensure transcription of the gene of interest in the epithelial cells and 
tubular gland ceils of the fallopian tube, leading to synthesis of the desired protein 
encoded by the gene and deposition into the egg white. In mammals, promoters 

15 specific for the epithelial ceils of the alveoli of the mammary gland, such as prolactin, 
insulin 9 beta lactogiobm, whey acidic protein, lactalbumin, casein, and/or placental 
lactogen, are used in the design of vectors used for transfection of these cells for the 
production of desired proteins for deposition into the milk. In liver ceils, the G6F 
promoter may be employed to drive transcription of the gene of interest for protein 

20 production* Proteins made in the liver of birds may be delivered to the egg yolk 

In order to achieve higher or more efficient expression of the transposase 
gene, the promoter and other regulatory sequences operably-Hrsked to the transposase 
gene may be those derived from the host These host specific regulatory sequences 
can be tissue specific as described above or can be of a constitutive nature. For 

25 example, an avian actin promoter and its associated polyA sequence can he operably- 
linked to a transposase in a transposase-based vector for transection into an avian. 
Examples of other host specific promoters that could be operably-Imked to the 
transposase include the myosin and DNA or RNA polymerase promoters. 
BirecrinaSeoyenc^ 

30 In some embodiments of the presesit invention, the geue of interest is 

operahly-iinked to a directing sequence or a sequence that provides proper 
conformation to the desired protein encoded by the gene of interest, As used herein, 
the term "directing sequence" refers to both signal sequences and targeting sequences, 
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An egg directing sequence includes, but is not limited to, m ovomucoid signal 
sequence, an ovalbumin signal sequence and a vitellogenin targeting sequence. The 
term "signal sequence" refers to an amino acid sequence, or the polynucleotide 
sequence that encodes the amino acid sequence, that directs the protein to which it is 
5 linked to the endoplasmic reticulum in a eukaryote, and more preferably the 
translocations! pores in the endoplasmic reticulum, or Hie plasma membrane in a 
prokaiyote, or mitochondria/ such us for the purpose of gene therapy of mitochondrial 
diseases. Signal and targeting sequences can be used to direct a desired protein into, 
for example, the milk, when the transposon-h&sed vectors are administered to a milk- 

10 producing animal 

Signal sequences can also he used to direct a desired protein into* for example, 
a secretory pathway for incorporation into the egg yolk or the egg white, when the 
transposon-based vectors are administered to a bird or other egg-laying animal. One 
example of such a transposon-based vector is provided in Figure 3 wherein fee gene 

15 of interest is operably linked to the ovomucoid signal sequence. The present 
invention also includes a gene of interest operably-linked to a second gene containing 
a signal sequence. An example of such an embodiment is shown in Figure 2 wherein 
the gene of interest is operahly-linked to the ovalbumin gene that contains an 
ovalbumin signal sequence. Other signal sequences that can be included in the 

20 transposon-based vectors include, but are not limited to the ovotransferrin and 
lysozyrne signal sequences. 

As also used herein, the term "targeting sequence* 5 refers to an amino acid 
sequence, or the polynucleotide sequence encoding the amino acid sequence, which 
amino acid sequence as recognized by a receptor located on the exterior of a ceil. 

25 Binding of the receptor to the targeting sequence results in uptake of the protein or 
peptide operably-! inked to the targeting sequence by the celL One example of a 
targeting sequence is a vitellogenin targeting sequence that is recognized by a 
vitellogenin receptor (or the low density lipoprotein receptor) on the exterior of an 
oocyte, In one embodiment, the vitellogenin targeting sequence includes the 

30 polynucleotide sequence of SEQ ID NO:18. In another embodiment, the vitellogenin 
targeting sequence includes all or part of the vitellogenin gene. Other targeting 
sequences include VLDL and Apo E ? which are also capable of binding the 
vitellogenin receptor. Since the ApoE protein is not endogenously expressed in birds, 
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its presence may be used advantageously to identify birds carrying the transpose- 
based rectors of the present invention. 

A gene of interest selected for stable incorporation is designed to encode any 
5 desired protein or peptide ©r to regulate any cellular response. In some embodiments, 
the desired proteins or peptides are deposited in an egg or in milk. It is to be 
understood that the present invention encompasses transposon-based vectors 
containing multiple genes of interest. The multiple genes of interest may each be 
operably-linked to a separate promoter and other regulatory sequence(s) or may ail be 

10 ©perahly-linked to the same promoter and other regulatory sequences(s). In one 
embodiment, multiple gene of interest are linked to a single promoter mid other 
regulatory sequence(s) and each gene of interest is separated by a cleavage site or a 
pro portion of a signal sequence. 

Protein and peptide hormones are a preferred class of proteins in the present 

15 invention, Snch protein and peptide hormones are synthesized throughout the 
endocrine system and include, but are not limited to, hypothalamic hormones and 
hypophysiotropic hormones, anterior, intermediate and posterior pitmtary hormones, 
pancreatic islet hormones, hormones made in the gastrointestinal system, renal 
hormones* thymic hormones, parathyroid hormones, adrenal cortical and medullary 

20 hormones. Specifically,, hormones that can be produced using the present invention 
include, but are not limited to* chorionic gonadotropin, corticotropin, erythropoietin, 
glucagons, IGF'-l, oxytocin, platelet-derived growth factor, calcitonin, follicle- 
stimulating hormone, leutinizing hormone, thyroid-stimulating hormone, insulin, 
gonadotropin-releasing hormone and its analogs, vasopressin, octreotide, 

25 somatostatin, prolactin, adrenocorticotropic hormone, antidiuretic hormone, 
thyrotropin-releasing hormone (TRII), growth hormone-releasing hormone (GHRH), 
dopamine, melatonin, thyroxin (T*), parathyroid hormone (PTH), glucocorticoids 
such as Cortisol, mineralocorticoids such as aldosterone, androgens such as 
testosterone, adrenaline (epinephrine), noradrenaline (norepinephrine), estrogens such 

30 as estradiol, progesterone, glucagons, calcitrol, calciferol, atrial-natriuretic peptide, 
gastrin^ secretin, choleeystokinm (CCK), neuropeptide Y, ghrelin, 
angiotensinogen, thrornbopoaetin, and leptin. By using appropriate polynucleotide 
sequences, species-specific hormones may be made by transgenic animals. 
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In one embodiment of the present invention, the gene of interest is a proinsuim 
gene and the desired molecule is insulin, Proinsulm consists of three parts: a C- 
peptide and two long strands of amino acids (called the alpha and beta chains) that 
later become linked together to form the insulin molecule. Figures 2 and 3 are 
5 schematics of transposon-based vector constructs containing a proinsulin gene 
operably-linked to an ovalbumin promoter and ovalbumin protein or an ovomucoid 
promoter and ovomucoid signal sequence, respectively. hi these embodiments, 
proinsulm is expressed in the oviduct tuhukr gland cells and then deposited in the egg 
white. One example of a proinsulin polynucleotide sequence is shown in SEQ ID 
10 NO:21, wherein the C-peptide cleavage site spans from Arg at position 31 to Arg at 
position 65, 

Serum proteins including lipoproteins such as high density lipoprotein (HDL), 
HDL-Milano and low density lipoprotein, albumin* clotting cascade factors, factor 
VIH, factor IX, fibrinogen, and globulins are also included in the group of desired 

15 proteins of the present invention, Immunoglobulins are one class of desired globulin 
molecules and include but are not limited to IgG s IgM, IgA, IgD, Ig£, IgY, lambda 
chains, kappa chains and fragments thereof; Fc fragments, and Fab fragments, 
Desired antibodies include, hut are not limited to, naturally occurring antibodies, 
human antibodies, humanized antibodies, and hyhrid antibodies, Genes encoding 

20 modified versions of naturally occurring antibodies or fragments thereof and genes 
encoding artificially designed antibodies or fragments thereof may be incorporated 
into the transposon-based vectors of the present invention. Desired antibodies also 
include antibodies with the ability to bind specific ligands, for example, antibodies 
against proteins associated with cancer-related molecules, such as anti -her 2, or ami- 

25 CA125. Accordingly, the present invention encompasses a transposon-based vector 
containing one or more genes encoding a heavy imsnunogfiobuiin (Ig) chain and a 
light Ig chain. Further, more than one gene encoding for more than one antibody may 
be administered in one or more transposon-based vectors of the present invention. In 
this manner, an eg£ may contain more than one type of antibody in the egg white, the 

30 egg yolk or both. 

In one embodiment, a tr&nsposon-based vector contains a heavy Ig chain and a 
light Ig chain, both operably linked to a promoter. Figures 5 and 6 schematically 
depict exempiaxy constructs of this embodiment More specifically, Figure 5 shows a 
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construct containing a ceeropk pre~pro sequence and a cecropir* pro sequence, 
wherein the pre sequence functions to direct the resultant protein Into the endoplasmic 
reticulum and the pro sequences and the pro sequences ate cleaved upon secretion of 
the protein from a cell into which the construct has been transacted. Figure 6 shows 
5 a construct containing m snterokinase cleavage site. In this embodiment, it may be 
required to further remove several additional amino acids from the light chain 
following cleavage by entaolrinBS* In another embodiment, the to^poson-based 
vector comprises a heavy Ig chain operably-iinked to one promoter and a light Ig 
chain operahly-linked to another promoter. Figure 7 schematically depicts an 
10 exemplary construct of this embodiment. The present invention also encompasses a 
transposon-based vector containing genes encoding poxtions of a heavy Ig chain 
arid/or portions of a light Ig chain. The present invention further includes a 
transposon-b&sed vector containing a gene that encodes a fusion protein comprising a 
heavy and/or light Ig chain, or portions thereof 

15 Antibodies used as therapeutic reagents include but are not limited to 

antibodies for use in cancer immunotherapy against specific antigens, or for providing 
passive immunity to an animal or a human against an infectious disease or a toxic 
agent. Antibodies used as diagnostic reagents include, but are not limited to 
antibodies that may be labeled and detected with a detector, for example antibodies 

20 with a fluorescent label attached that may be detected following exposure to specific 
wavelengths. Such labeled antibodies may be primary antibodies directed to a 
specific antigen, for example, thodaxmne-laheled rabbit axati-growth hormone, or may 
be labeled secondary antibodies, such as fluoresces n-labeled goat-ami chicken IgG, 
Such labeled antibodies are known to one of ordinary skill in the art. Labels useful 

25 for attachment to antibodies are also known to one of ordinary skill in the art. Some of 
these labels are described m the "Handbook of Fluorescent Probes and Research 
Products" ninth edition, Richard P. HaugSand (ed) Molecular Probes, Inc. Eugene, 
OR), which is incorporated herein in its entirety. 

Antibodies produced with using the present invention may be used as 

30 laboratory reagents for numerous applications including radioimmunoassay, western 
blots, dot blots, ELXSA, inmiuno&fiinity columns and other procedures requiring 
antibodies as known to one of ordinary skill in the art Such antibodies include 
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primary antibodies, secondary antibodies and tertiary antibodies, which may be 
labeled or unlabeled. 

Antibodies that may be made with the practice of the present invention 
include, but are not limited to primary antibodies, secondary antibodies, designer 

5 antibodies, anti-protein antibodies, anti-peptide antibodies, anti~DNA antibodies, anti- 
RNA antibodies, anti-hormone antibodies, anti-hypophysiotropic peptides, antibodies 
against non-natural antigens; anti-anterior pituitary hormone antibodies, anti-posterior 
pituitary hormone antibodies, anti-venom antibodies, anti-tumor marker antibodies, 
antibodies directed against epitopes associated with infections disease, including, anti- 

0 viral, anti-bacterial, anti-protozoa^ anti*ftmgal, anti-parasitic, anti-receptor, ahti-lipid, 
anti-phospholipid, anti-growth factor* anti-cytoksne, anti-monokine, anti-idiotype, and 
anti-accessory (presentation) protein antibodies, Antibodies made with the present 
invention, as well as light chains or heavy chains, may also be used to inhibit enzyme 
activity, 

5 Antibodies that may be produced using the present invention include, but are 

not limited to, antibodies made against the following proteins: Bovine y-Globuiin, 
Serum; Bovine TgG, Plasma; Chicken y-GIobulm, Serum; Human y-Globnlin, Serum; 
Human IgA. Plasma; Human XgA«, Myeloma; Human IgA?, Myeloma; Human IgA2» 
Plasma; Human IgB, Plasma; Human IgE, Myeloma; Human XgG, Plasma; Human 
IgG, Fab Fragment, Plasma; Human IgG, F(ab*) 2 Fragment, Plasma; Human IgG, Fc 
Fragment, Plasma; Human IgG \ 9 Myeloma; Human IgGi, Myeloma; Human IgGn, 
Myeloma; Human IgGUs Myeloma; Human IgM, Myeloma; Human IgM, Plasma; 
Human Immunoglobulin, Light Chain k, Urine; Human Immunoglobulin, Light 
Chains k and X s Plasma; Mouse y~Giobuiin, Serum; Mouse IgG, Serum; Mouse IgM, 
Myeloma; Rabbit y-GlobuIin, Serum; Rabbit IgG, Plasma; and Rat ^-Globulin, 
Serum, In one embodiment, the transposon-based vector comprises the coding 
sequence of light and heavy chains of a murine monoclonal antibody that shows 
specificity for human seminoprotein (Ge&Bank Accession numbers AY 1 29006 and 
AY 129504 for the light and heavy chains, respectively), 

A further non-limiting list of antibodies that recognize other antibodies is as 
follows: Anti-Chicken IgG, heavy (H) & light (L) Chain Specific (Sheep); Anti-Goat 
Y-Globulin (Donkey); Anti-Goat IgG, Fc Fragment Specific (Rabbit); Anti-Guinea Fig 
Y-Globulin (Goat); Anti-Human Ig, Light Cham, Type k Specific; Anti-Human Ig, 
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Light Chain, Type X Specific; Anti-Human IgA, a~Chain Specific (Goat); Anti- 
Human IgA s Fab Fragment Specific; Anti -Human IgA, Fc Fragment Specific; Anti« 



Fc Fragment Specific; Anti-Huznan IgG, Fc Fragment Specific (Goat); Anti~Hnrnan 



IgG, Fd Fragment Specific; Anti-Huxnan IgG, H & L Chain Specific (Goat); Anti- 
Human IgGj, Fc Fragment Specific; Anti-Human igG^ Fc Fragment Specific; Anti- 
Human IgGa* Fd Fragment Specific; Anti-Human IgGs, Hinge Specs lie; Anti-Human 
IgG 4s Fc Fragment Specific; Anti-Human IgM, Fc Fragment Specific; Anti-Human 

10 IgM, |Jt~Chain Specific; Anti-Mouse Ig£> e-Chain Specific; Anti-Mouse y-Glohuiin 
(Goat); Anti-Mouse IgG, y*Chain Specific (Goat); Anti-Mouse IgG, -y-Chain Specific 
(Goat) F(aV)2 Fragment; Anti-Mouse IgG> H & L Chain Specific (Goat); Anti-Mouse 
IgM, p-Chain Specific (Goat); Anti-Mouse IgM, H & L Chain Specific (Goat); Anti- 
Rabbit 7-Globulin (Goat); Anti-Rabbit IgG, Fc Fragment Specific (Goat); Anti-Rabbit 

15 IgG, H & L Chain Specific (Goat); Anti-Rat globulin (Goat); Anti-Rat IgG, HAL 
Chain Specific; Anti-Rhesus Monkey y-Glohulin (Goat); and, Anti-Sheep IgG ? H A L 
Chain Specific. 

Another non-limiting list of the antibodies that may be produced using the 
present invention is provided in product catalogs of companies such as Phoenix 

20 Pharmaceuticals, lac, {www^phoenlxpepiidexom; 530 Harbor Boulevard, Belmont, 
CA) 9 Peninsula Labs San Carlos CA, SIGMA, StLouis, MO www.sigma- 
aldricli.com, Cappei ICN, Irvine. California, www.icnbiomed.coms, and Calbiochem, 
La j'olla, California, www.calbiochemxom, which are all incorporated herein by 
reference in their entirety. The polynucleotide sequences encoding these antibodies 

25 may be obtained from the scientific literature, from patents* and from databases such 
as GenBank. Alternatively, one of ordinary skill m the art may design the 
polynucleotide sequence to be incorporated into the genome by choosing the codons 
that encode for each amino acid in the desired antibody. Antibodies made by the 
transgenic animals of the present invention include antibodies that may be used as 

30 therapeutic reagents, for example in cancer immunotherapy against specific antigens, 
as diagnostic reagents and as laboratory reagents for numerous applications including 
immunonemraiigation, radioimmunoassay, western blots, dot blots, ELISA, 
immunoprecapitation and kntnuno&ffmity columns. Some of these antibodies include. 



Human IgA } Secretory; Anti-Human igE, e-Chain Specific (Goat); Anti-Human IgB, 
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but are not limited to, antibodies which bind the following ligands: adrenornednlin, 
amylin, calcitonin, amyloid, calcitonin gene-related peptide, cfoolecystoldnm, gastrin, 
gastric inhibitory peptide, gastrin releasing peptide, interleukm, interferon,' eor&statin* 
somatostatin, endothelin, sarafotoxin, glucagon, gluc&gon-like peptide, insulin, atrial 
5 natriuretic peptide, BNF, CNF, neurokinin, substance P, leptin, neuropeptide Y, 
melanin concentrating hormone, melanocyte stimulating hormone, orphanin, 
endorphin, dynorphin, enkephalin, enkephalin, Ieumorphin, peptide F, PACAP, 
PACAF-related peptide, parathyroid hormone, urocortin, corticotropin releasing 
hormone, PHM, PHI, vasoactive intestinal polypeptide, secretin, ACTH, angiotensin, 

10 angiostatin, bombesin, endost&tin, bradykinin, FMRF amide, galanin, gonadotropin 
releasing hormone (GnRH) associated peptide, GnRH, growth hormone releasing 
hormone, anhibin, granulocyte-macrophage colony stimulating factor (GM-CSF), 
motilin, neurotensin, oxytocin, vasopressin, osteocalcin, panere&$taiin„ pancreatic 
polypeptide, peptide YY, proopiomelanocortin, tfansforming growth factor, vascular 

15 endothelial growth factor, vesicular monoamine transporter, vesicular acetylcholine 
transporter, ghrelin, NPW, NPB, C3d f prokinetican, thyroid stimulating hormone, 
luteinizing hormone, follicle stimulating hormone, prolactin, growth hormone, beta- 
lipotropic melatonin, fcallikriens, kihihs, prostaglandins, erythropoietin, pi 46 (SEQ 
ID NO: 18 amino acid sequence, SEQ IB NO: 1 9, nucleotide sequence), estrogen, 

20 testosterone, corticosteroids, mfaeralocordcoids, thyroid hormone, thymic hormones, 
connective tissue proteins, 'nuclear proteins, actin, avidin, activity agnn, albumin, and 
prohormones, propeptides, splice variants, fragments and analogs thereof 

The following is yet another non- limiting of antibodies that can be produced 
by the methods of present invention: abciximab (ReoPro), abciximab anti -platelet 

25 aggregation monoclonal antibody, anti-CD! la (mil 124), anti-GDI 8 antibody, anti- 
CD20 antibody, anti -cytomegalo virus (CMV) antibody, anti-digoxin antibody, anti- 
hepatitis B antibody, anti-HER-2 antibody, antiidiotype antibody to GD3 glycolipid, 
anti-IgE antibody, anti-IL-2R antibody, antimetastatic cancer antibody (tnAb 17-iA), 
anti-rabies antibody, anti-respiratory syncytial vims (RSV) antibody* anti-Rh 

30 antibody, anti-TCR, anti-TNF antibody, anti-VEGF antibody and lab fragment 
thereof, rattlesnake venom antibody, black widow spider venom antibody, coral snake 
venom antibody, antibody against very late antigen-4 (VLA-4), C225 humanized 
antibody to EGF receptor, chimeric (human '& mouse) antibody against TNFa, 
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antibody directed against GPIIb/HIa receptor on human platelets, gamma globulin, 
anti-hepatitis B immunoglobulin, human anti-D immunoglobulin, human antibodies 
against S aureus, human tetanus immunoglobulin, humanized antibody against the 
epidermal growth receptor-2, humanized antibody against the a subunk of the 
5 interleukin~2 receptor,, humanized antibody CTLA4IG, humanized antibody to the 
IL-2 R a-chaia, humanized anti«CD40~Iigand monoclonal antibody (Sc8), humanized 
mAb against the epidermal growth reeeptor~2 $ humanized mAb to rous sarcoma virus, 
humanized recombinant antibody (IgGlk) against respiratory syncytial vims (RSV), 
lymphocyte immunoglobulin (anti-thymocyte antibody), lymphocyte 

10 immunoglobulin, mAb against factor VII, MDX-210 bi-specific antibody against 
HER-2, MDX-22, MDX-220 bi~specific antibody against TAG-72 on tumors, MDX~ 
33 antibody to FoyRl receptor, MDX-447 bi-speci£ac antibody against EGF receptor, 
MDX-447 bispecific humanized antibody to EGF receptor, MDX-RA immunotoxin 
(ricin A linked) antibody, Medi-507 antibody (humanized ibrm of BT1-322) against 

15 CD2 receptor on T-eelis, monoclonal antibody LDP-02, muromonab-CB3(OKT3) 
antibody, OKT3 ("muroniomab~€D3") antibody, PRO 542 antibody, ReoPro 
("ahesximah") antibody, and TNF-XgG fusion protein. 

The antibodies prepared using the methods of the present invention may also 
be designed to possess specific labels that may be detected through means known to 

20 one ©f ordinary skill in the art, The antibodies may also be designed to possess 
specific sequences useful for purification through means known to one of ordinaxy 
skill in the art, Specialty antibodies designed for binding specific antigens may also 
be made in transgenic animals using tine tnmsposon-hased vectors of the present 
invention. 

25 Production of a monoclonal antibody using the transposon-based vectors of 

the present invention can be accomplished in a variety of ways. In one embodiment, 
two vectors may be constructed: one that encodes the light chain, and a second vector 
that encodes the heavy chain of the monoclonal antibody. These vectors may then be 
incorporated into the genome of the target animal by methods disclosed herein. In an 

30 alternative embodiment, the sequences encoding light and heavy chains of a 
monoclonal antibody amy be included on a single DNA construct, For example, the 
coding sequence of light and heavy chains of a murine monoclonal antibody that 
show specificity for human seminoprotein can be expressed using transpose n-based 
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constructs of the present invention (GenB&nk Accession numbers AY] 29006 and 
AY 1 29304 for the light and heavy chains, respectively). 

Further included m the present inveniiots are proteins and peptides synthesized 
by the immune system iticMding those syE*thesized by the thymus, lymph nodes, 
spleen, ar$d the gastrointestinal associated l>ixiph tissues (GALT) system. The 
immune system proteins and peptides proteins that can be made in transgenic animals 
using the tr&nsposon-based vectors of the present invention include, but are not 
limited to, alpha~interleron s beta -Interferon, gamma~interfeton s . alpha^nterferon A„ 
alpha-interferon 1, G-C3F, GM-GSF, intedwldn-i (IL-1), IL-2, 1L-3, MM IL-5, IL-6 y 
XL-7, IL-S, IL-9, XL-IO, IL-I1, II. -12, IL-13, TNF-a ? and TNFrp. Other cytokines 
included in the present invention include eardiotrophin, stromal cell derived factor, 
macrophage derived chernokine "(MDC), melanoma growth stimulatory activity 
(MGSA), macrophage inflamniatory proteins I alpha (MSP A alpha), 2, 3 alpha, 3 
beta, 4 and 5. 

Lytic peptides such as p!46 are also included in the desired molecules of the 
present invention* In one embodiment, the pi 46 peptide comprises an amino acid 
sequence of SBQ ID NO: 19, The present invention also encompasses a transposes- 
based vector comprising a pi 46 nucleic acid comprising a polynucleotide sequence of 
SBQ ID NO:20. 

Enzymes are another class of proteins that may be made through the use of the 
transposon-based vectors of the present invention. Such enzymes include but are not 
limited to adenosine deaminase, alpha-galactosidase, cellulase, collagenase, dnasel, 
byaluronidase, lactase, L-asparaginase, panereatin, papain, streptokinase B, subtilisin, 
superoxide disrnutase, thrombin, trypsin, urokinase, fibrinolysis!, glucocerebrosidase 
and plasminogen activator. In some embodiments wherein the enzyme could have 
deleterious effects, additional amino acids and a protease cleavage site are added to 
the earboxy end of the enzyme of interest in order to prevent express ion of a 
functional enzyme. Subsequent digestion of the enzyme with a protease results in 
activation of the enzyme. 

Extracellular matrix proteins are one class of desired proteins that may be 
made through the use of the present invention. Examples include but are not limited 
to collagen, fibrin, elastin, laminin, and fibronoctin and subtypes thereof. Intracellular 
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proteins and structural proteins are other classes of desired proteins in the present 
invention. 

Growth factors are another desired class of proteins that may be made through 
the use of the present invention and include, hut are not limited to, transforming 
growth factor-cc ("TGF-ot"), transforming growth factor-0 (TGF-3), platelet-derived 
growth factors (PDGF), fibroblast growth factors (FGF), including FGF acidic 
isoforms 1 and 2, FGF basic form 2 and FGF 4, 8, 9 and 10, nerve growth factors 
(NGF) including NGF 2.5s, NGF 7.0s and beta NGF and neurotrophic, brain derived 
neurotrophic factor, cartilage derived factor, growth factors for stimulation of the 
production of red blood cells, growth factors for stimulation of the production of 
white blood cells, bone growth factors (BGF), basic fibroblast growth factor, vascular 
endothelial growth factor (VEGF), granulocyte colony stimulating factor (G-CSF), 
insulin like growth factor (IGF) I and II, hepatocyte growth factor, glial neurotrophic 
growth factor (GDNF), stem cell factor (SCF), keratinocyte growth factor (KGF), 
transforming growth factors (TGF), including TGFs alpha, beta, betal, beta2, beta3, 
skeletal growth factor, bone matrix derived growth factors, bone derived growth 
factors, erythropoietin (EPO) and mixtures thereof. 

Another desired class of proteins that may be made may be made through the 
use of the present invention include but are not limited to leptin, leukemia inhibitory 
factor (LIF), tumor necrosis factor alpha and beta, ENBREL, angiostatin, endostatin, 
thrombospondin, osteogenic protein-1, bone morphogenetic proteins 2 and 7, 
osteonectin, somatomedin-iike peptide, and osteocalcin. 

A non-limiting list of the peptides and proteins that may be made may be 
made through the use of the present invention is provided in product catalogs of 
companies such as Phoenix Pharmaceuticals, Inc. (www.phoenixpeptide.com; 530 
Harbor Boulevard • Belmont, CA), Peninsula Labs San Carlos CA, SIGMA, StXouis, 
MO www.sigma-aldrich.com, Cappe! ICN, Irvine, California, www,icnbiomed.corn s 
and Calbiochem, La Jolla, California, wwwxalbiochenixom. The polynucleotide 
sequences encoding these proteins and peptides of interest may be obtained from She 
scientific literature, irom patents, and from databases such as GenBank. 
Alternatively, one of ordinary skill in the art may design the polynucleotide sequence 
to be incorporated into the genome by choosing the codons that encode for each 
amino acid in the desired protein or peptide, 
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Some of these desired proteins or peptides that may be made throisgh the use 
of the present invention include tut anb not IniBited to the following: adrenomedulm, 
amylm, calcitonin, amyloid, calcitonin gene-related peptide, cholecystokinin, gastrin, 
gastric inhibitory peptide, gastrin releasing peptide, interleukin, interferon, eortistatin, 
somatostatin, endoihelin, sarafotoxiiij glucagon, glneagon-Iike peptide, insulin, atrial 
natriuretic peptide. BNP* CNF, neurokinin* substance P 9 leptin, neuropeptide Y, 
melanin concentrating hormone, melanocyte stimulating hormone, orphanin, 
endorphin, dynorphin, enkephalin, "Teuinocpbin, peptide F, FACAP, 
peptide, parathyroid hormone, nrocortin, corticotropin releasing 
10 PHI, vasoactive intestinal polypeptide, secretin, ACTH, angiotensin, angiostatin, 
bombesin, endostatin, bradykinin, FMRF amide, galanin, gonadotropin releasing 
hormone (GnRH) associated peptide, GnRH, growth hormone releasing hormone, 
inhihin, granulocyte-macrophage colony stimulating factor (GM-CSF), motilin, 
neurotensin, oxytocin, vasopressin, osteocalcin, pancreas tatin, pancreatic polypeptide, 
15 peptide YY, proopiomelanocortin, transforming growth factor, vascular endothelial 
growth factor, vesicular monoamine transporter, vesicular acetylcholine transporter, 
ghrelin, NFW, NPB S C3d, prokinetie&n, thyroid stimulating hormone, luteinizing 
hormone, follicle stimulating hormone, prolactin, grow* hormone, beta-Iipotropin, 
melatonin, kallikriens, kinins, prostaglandins, erythropoietin, p!46 (SEQ IP NO: 19, 
20 amino acid sequence, SEQ ID NO:20, nucleotide sequence), thymic hormones, 
connective tissue proteins, nuclear proteins, actin* avidia, activin, agrin, albumin, and 
prohormones, propeptides, splice variants, fragsuents and analogs thereof. 

Other desired proteins that may be made by the transgenic animals of the 
present invention include bacitracin, polymyxin b ? vancomycin, cyclosporine, anti- 
25 RSV antibody, alpha- 1 antitrypsin (AAT), anti-cytornegaloviras antibody, anti- 
hepatitis antibody, anti-inhibitor coagulant complex, anti-rabies antibody ,'anti-Rh(D) 
antibody, adenosine deaminase, anti-digoxin antibody 9 antivenin erotalidae 
(rattlesnake venom antibody), antivenin latrodectus {black widow spider venom 
antibody), antivenin micmras (coral snake venom antibody), aprotinin, corticotropin 
30 (ACTH), diphtheria antitoxin, lymphocyte imimme globnlin (anti-tbymocyte 
antibody), protamine, thyrotropin, eapreomycm, a-galactosidase, gramicidin, 
streptokinase, tetanus toxoid, tyrothricixi, IGF-1, proteins of varicella vaccine, anil- 
TNF antibody, anti*IL-2r antibody, anti~HER~2 antibody, OKT3 ("mnromonab- 
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CD3") antibody, TNF-IgG fusion protein, ReoPro {"abciximah") antibody, ACTB 
fragment 1-24, desmopressin, gonadotropin-rekasing homione, histreliu, ieuprolide, 
lypressin, nafareik, peptide that binds GPIIb/GFlHa on platelets (integrilin), 
gosereiin, capreoinycin, eolistin, anti-respiratory syncytial vims, lymphocyte immune 
5 globulin (Thymogtorii!, Atgam), panorex, aipha-antitfypsin, botuiinm, lung surfactant 
protein, tumor necrosis rcceptor-IgG fusion protein Cenbrei), gonadorelin, proteins of 
influenza vacei3ie, proteins of rotavirus vaccine, proteins of haemophilia b conjugate 
vaccine, proteins of polioviras vaccine, proteins of pneumococcal conjugate vaccine, 
proteins of meningococcal C vaccine, proteins of influenza vaccine, megakaryocyte 

10 growth and development factor (MGDF), neuroimmnnophilln ligand-A (NIL- A), 
brain-derived neurotrophic factor (BDNF), glial cell line-derived neurotrophic factor 
(GDNF), leptin (native), leplin B, leptin C, IL4RA (interleukin-lRA), R-S68, novel 
erythropoiesis-stimulating protein (NESF), humanized mAb to rous sarcoma virus 
(MEDI-493), glutamyl-tryptophan dipsptide JM862, LFA-3TIP immunosuppressive, 

15 humanized anti~€D4Q~ligand monoclonal antibody (5c8), gelsonin enzyme, tissue 
factor pathway inhibitor (TFFI), proteins of meningitis B vaccine, antimetastatic 
cancer antibody (mAb 174 A), chimeric (human <& mouse) mAb against TNFa, mAb 
against factor VTE, relaxin, capreomycin, glycopeptide (LY333328), recombinant 
human activated protein C (rtiAPC), humanized mAb against the epidermal growth 

20 receptor-2, aitepase, anti~€D20 antigen, C2BS antibody, insulin-like growth factor- 1, 
atrial natriuretic peptide (anaritide), tenectaptese, anti~€Dlla antibody (hu 1124), 
anti-CD IS antibody, mAb LDP-02, anti-VEGF antibody, fab fragment of anti-VEGF 
Ah, APG2 Sigand (turnor necrosis factor-related apopiosis-inducing ligand), rTGF-P 
(transforming growth £aetor~P) } alpha-antitrypsin, ananain (a pineapple enzyme), 

25 humanized mAb CTLA4IG, FRO 542 (mAb), D2E? (mAb), calf intestine alkaline 
phosphatase, a~L~;duronidasc, a-L-galactosid&se (hmnanglutamic acid decarboxylase, 
acid sphingomyelinase, bone morphogenetie protein-2 (rhBMP-2), proteins of HIV 
vaccine, T cell receptor (TCR) peptide vaccine, TCR peptides, V beta 3 and V beta 
13.1. (XR5G2), (XR501), BI 1050/1272 mAb against very late antigen-4 (VLA-4), 

30 C225 humanized mAb to EGF receptor, anti-idiotype antibody to GD3 glycolrpid, 
antibacterial peptide against i£ pylori, MDXA41 bispecifsc humanized mAb to EOF 
receptor, anti -cytomegalovirus (CMV), Medi-491 B!9 parvovirus- vaccine, humanized 
recombinant rnAb (IgGlk) against respiratory syncytial virus (RSV)» urinary tract 
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infection vaccine (against "piir on Escherichia coli strains), proteins of lyme disease 
vaccine against & burgdorferi protein (DbpA) y proteins of Medi-501 human 
papilloma vims*] 1 vaccine (HFV} S Streptococcus pneumoniae vaccine, Medi-507 
mAb (humanized form of BTX-322) against CD2 receptor on T~ceffs 3 MDX-33 mAb 
5 to FcyRI receptor, MDX-RA imniunotoxin (ricin A linked) mAb, MDX-2I0 bi- 
specific mAb against HER-2, MDX-447 bi~speeifie mAb against EOF receptor, 
MDX-22, MDX-220 bi- specific mAb against TAG-72 on tumors, eo!ony~stinuikting 
factor (CSF) (molgramostim), humanized mAb to the IL-2 R a~cham (basiliximab), 
mAb to IgE (IGE 025A) $ myelin basic protein-altered peptide "(MSP771A), 

1 0 humanized mAb against the epidermal growth receptor-2, humanised mAb against the 
a subimit of the hiter!eiskm-2 receptor, low molecular weight Heparin, anti- 
hemophillic factor, and baciericidal/permcability-incrcasing protein (r-BPX), 

Tbe peptides and proteins made using the present invention may be labeled 
i&smg labels and techniques ksitiwn to one of ordinary skill In the art. Some of these 

15 labels are described in the "Handbook of Fluorescent Probes and Research Products**, 
ninth edition, Richard P. Haugland (ed) Molecular Probes, Inc> Eugene, OR), which is 
incorporated herein in its entirety; Some of these labels may be genetically 
engineered into the polynucleotide sequence for the expression of the selected protein 
or peptide. The peptides and proteins may also have label-incorporation "handles" 

20 incorporated to allow labeling of an otherwise difficult or impossible £o label protein. 

It is to be understood that the various classes of desired peptides and proteins,, 
as well as specific peptides and proteins described in this section may be modified as 
described below by inserting selected codons for desired amino acid substitutions into 
the gene incorporated into the transgenic animal. 

25 The present invention may also be nsed to produce desired molecules other 

than proteins and peptides including, but not limited to, lipoproteins such as high 
density lipoprotein (HDL), HDL-Milano, and low density lipoprotein, lipids, 
carbohydrates, siRNA and rihosymes. In these embodiments, a gene of interest 
encodes a nucleic acid molecule or a protein that directs production of the desired 

30 molecule. 

The present invention further encompasses the use of inhibitory molecules to 
inhibit endogenous {Le,, non-vector) protein production, These inhibitory molecules 
include antisense nucleic acids, siRNA and inhibitory proteins. In one embodiment, a 
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transposon-based vector containing an ovalbumin DNA sequence, that upon 
transcription forms a double stranded RNA molecule, is transfected into an animal 
such as a bird and the bird's production of endogenous ovalbumin protein is reduced 
by the interference RNA mechanism (RNAi). Additionally, inducible knockouts or 
5 knockdowns of the endogenous protein may be created to achieve a reduction or 
inhibition of endogenous protein production,. 

Modified Desired Proteins ^d Peptides 

"Proteins", "peptides," "polypeptides*- and "oligopeptides" are chains of amino 
acids (typically L-amino acids) whose alpha carbons are linked through peptide bonds 

1 0 formed by a condensation reaction between the carboxyl group of the alpha carbon of 
one amino acid and the amino group of the alpha carbon of another amino acid. The 
terminal amino acid at one end of the chain (i.e., the amino terminal) has a free amino 
group, while the Terminal amino acid at the other end of the chain (ie, } the carboxy 
terminal) has a free carboxyl group. As such, the term "amino terminus" (abbreviated 

IS N-termmus) refers to the free alpha-amino group on the amino acid at the amino 
terminal of the protein, or to the alpha-ammo group (imino group when participating 
in a peptide bond) of an amino acid at any other location within the protein. 
Similarly, the term "carboxy terminus" (abbreviated C-termiaus) refers to the free 
carboxyl group on the amino acid at the carboxy terminus of a protein, or to the 

20 carboxyl group of an amino acid at any other location within the protein. 

Typically, the amino acids making up a protein are numbered in order, starting 
at the amino terminal and increasing in the direction toward the carboxy terminal of 
the protein. Thus, when one amino acid is said to "follow" another, that amino acid is 
positioned closer to the carboxy terminal of the protein than the preceding amino acid. 

25 The term "residue" is used herein to refer to an amino acid {D or L) or an 

amino acid mimetic that is incorporated into a protein by an amide bond. As such, the 
amino acid may be a naturally occurring amino acid or* unless otherwise limited, may 
encompass known analogs of natural amino acids that function in a manner similar to 
the naturally occurring amino acids (i.e., amino acid mimetics). Moreover, an amide 

30 bond mimetic includes peptide backbone modifications well known to those skilled in 
the art 

Furthermore, one of skill will recognise that, as mentioned above, individual 
substitutions, deletions or additions which alter, add or delete a single amino acid or a 
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small percentage of amino acids (typically less than about 5%, more typically less 
than about i%) in an encoded sequence are conservatively modified variations where 
the alterations result in the substation of an amino acid with a chemically similar 
amino acid. Conservative substitution tables providing functionally similar amino 
5 adds are well known in the art. The following six groups each contain amino acids 
that are conservative substitutions for one another: 

1) Alanine (A). Serine (S) > Threonine (T); 

2) Aspattie acid (B), Glutamic acid (E); 

3) Asparagine (M), Glutarmne (Q); 
10 4) Arginine (R) s Lysine (K); 

5) Isoieucine (I), Leucine (L), Methionine (M) t Valine ( V); mid 
6} Phenylalanine (F)„ Tyrosine (Y), Tryptophan (W). 

A conservative substitution is a substitution in which the substituting amino 
acid (naturally occurring or modified) is structurally related to the amino acid being 

IS substituted,, i.e., has about the same size and electronic properties as the amino acid 
being substituted. Thus, the substituting amino acid would have the same or a similar 
functional group m the side chain as the original amino acid, A "conservative 
substitution" also refers to utilizing a substituting amino acid which is identical to the 
amino acid being substituted except that a functional group in the side chain is 

20 protected with a suitable protecting group. 

Suitable protecting groups are described in Green and Wuts, "Protecting 
Groups in Organic Synthesis", John Wiley and Sons, Chapters 5 and 7, 1991, the 
teachings of which are incorporated herein by reference. Preferred protecting groups 
are those which facilitate transport of the peptide through membranes, for example,, by 

25 reducing the hydrophiKcity and increasing the lipophilicity of the peptide, and which 
can be cleaved, either by hydrolysis or ensymatically (Bitter et aL, 1968. J, Pharm. 
Sch 57:783; Ditter et at, 1968, J. Fhaxm, Set 57:828; Bitter et aL, 1969. J, Phaxm. 
Sol 58:557; King et aL, 1987. Biochemistry 26:2294; Lindberg et al, 1S2S9, Drug 
Metabolism and Disposition 17:311; Tunek et aL, 1988. Biochem. Phann. 37:3867; 

30 Anderson et aL, 1985 McK BSdchem, Biophys, 239:538; and Singhai et ai, 1987. 
FASEB L 1:220). Suitable hydroxy! protecting groups include eater, carbonate and 
carbamate protecting groups. Suitable amine protecting groups include acyl groups 
and alkoxy or aryloxy carbonyl groups, as described above for N-terminal protecting 
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groups, Suitable carboxylic acid protecting groups include aliphatic, beazyl and aryl 
esters, as described below for C-terminal protecting groisps. In one embodiment, the 
carboxylic acid group m the side chain of one or more glt3tamic acid or aspartic acid 
residues in a peptide of the present invention is protected, preferably as a methyl, 
5 ethyl, benzyl or substituted benzyl ester, more preferably as a benzyl ester. 

Provided below are groups of naturally occurring and modified amino acids in 
which each amino acid in a group has similar electronic and steric properties. Thus, a 
conservative substitution can be made by substituting an amino acid with another 
amino acid from the same group, It is to be understood that these groups are non- 
10 limiting, i.e, that there are additional modified amino acids which could be included in 
each group. 

Group 1 includes leucine, isoleucine, valine, methionine and modified amino acids 
having the following side chains: ethyl, n-propyl n-butyl. Preferably, Group I 
includes leucine, isoleueine, valine and methionine. 

15 Group n includes glycine, alanine, valine and a modified amino acid having an ethyl 

side chain, Preferably, Group II includes glycine and alanine. 
Group III includes phenylalanine* phenylglycine, tyrosine, tryptophan, 
cyclohexylmethyl glycine, and modified amino residues having substituted 
benzyl or phenyl side chains, Preferred substituents include one or more of 

20 the following: halogen, methyl, ethyl, nitro, — NKb, methoxy, ethoxy and — 

CN. Preferably, Group III includes phenylalanine, tyrosine and tryptophan. 
Group IV includes glutamic acid, aspartic acid, a substituted or unsubstituied 
aliphatic, aromatic or benzyl ic ester of glutamic or aspartic acid (e.g., methyl, 
ethyl, n-propyl iso-propyl, cyclohexyi, benzyl or substituted benzyl), 

25 glutamine, asparagine, — CO— NH™ alkylated glutamine or asparagines {e,g., 

methyl, ethyl, n-propyl and iso~propyl) and modified amino acids having the 
side chain — (CH^— COOH, an ester thereof (substituted or unsubstituted 
aliphatic aromatic or benzylic ester), an amide thereof and a substituted or 
unsubstituted N-alkylaled amide thereof. Preferably, Group IV includes 

30 glutamic acid, aspartic acid, methyl aspartate, ethyl aspartate, benzyl aspartate 

and methyl glutamate, ethyl glutamate and benzyl glutamate, glutamine and 
asparagine. 
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Group V includes histidine, lysine, ornithine, arginine, N~nitroargmine, B~ 
cycloargimne, y-hydroxyarginine, N-amidinocitiulixie and 2-amino-4- 
guanidinohutanoie acid, homologs of lysine, homologs of arginine and 
homologs of ornithine. Preferably, Group V includes histidine, lysine, 
5 arginine and ornithine, A homolog of an amino &cid includes from 1 to about 

3 additional or subtracted methylene units in the side chain. 
Group VI includes serine, threonine, cysteine and modified amino acids having C 1 - 
C5 straight or branched alkyl side chains substituted with —OH or — SH» for 
example, — CH^CR^OH, — CH2CH2CH2OH or -CH2CH2OHCH3. Preferably, 
10 Group VI includes serine, cysteine or threonine, 

In another aspect suitable substitutions for amino acid residues include 
"severe 55 substitutions, A "severe substitution" is a substitution in which the 
substituting amino acid (naturally occurring or modified) has significantly different 
size arkS/or electronic properties compared with the amino acid being substituted. 
15 Thus, the side chain of the substituting amino acid can be significantly larger (or 
smaller) than the side chain of the amino acid being substituted and/or can have 
functional groups with significantly different electronic properties than the amino acid 
being substituted. Examples of severe substitutions of this type include the 
substitution of phenylalanine or cyclohexyl methyl glycine for alanine, isoleucine for 
20 glycine, a D amino acid for the corresponding L amino acid,, or — NH — CH[(— 
CH2)y— COOH] — CO— for aspartic acid. Alternatively f a functional group may be 
added to the side chain, deleted from the side chain or exchanged with another 
functional group. Examples of severe substitutions of this type include adding of 
valine, leucine or isoleudne, exchanging the carbcxyiic acid in the side chain of 
25 aspartic acid or glutamic acid with an amine, or deleting the amine group in the side 
chain of lysine or ornithine. In yet another alternative, the side chain of the 
substituting amino acid can have significantly different steric and electronic properties 
that the functional group of the amino acid being substituted. Examples of such 
modifications include tryptophan for glycine, lysine for aspartic acid and — 
30 (CHa^CQOH for the side chain of serine. These examples are not meant to be 
limiting. 
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In another embodiment, for example in the synthesis of a peptide 26 amino 
acids in length, the individual amino acids may be substituted according in the 
following manner: 

AAi is serine, glycine, alanine, cysteine or threonine; 
AA2 is alanine, threonine, glycine, cysteine or serine; 

AA3 is valine, arginine, leucine, isolencine, methionine, ornithine, lysine, N- 
nitroargiftEne, B-cycloarginine, y-hydroxyargmine, N-amidinoeitraline or 2-ammo~4~ 
guamdinobatanoic acid; 

AA4 is proline, leucine, valine, isoleucine or methionine; 
AA 5 is tryptophan, alanine, phenylalanine, tyrosine or glycine; 
AA& is serine, glycine, alanine, cysteine or threonine; 
AA? is proline, leucine, valine, isoleucine or methionine; 
AAg is alanine, threonine, glycine* cysteine or serine; 
AA9 is alanine, threonine, glycine, cysteine or serine; 
AAsq is leuc ine* isoieucine, methionine or valine; 
AAi i is serine, glycine, alanine, cysteine or threonine; 
AA S 2 is leucine* isoleucine, methionine or valine; 
AAjj is leucine, isoleucine, methionine or valine; 

AA34 is glutamine, glutamic acid, aspartic acid, asparagine, or a substituted or 

unsiabstituted aliphatic or axyl ester of glutamic acid or aspartic acid; 

AA35 is arginine, N-nitroarginine, 8-cycioarginine, y~hydroxy-&rgimne, N- 

auiidmocitmline or 2-axnino~4-guamdmo-butanoic acid 

AA i6 is proline, leucine, valine, isoleucine or methionine; 

AAn is serine, glycine, alanine, cysteine or threonine; 

AAjg is glutamic acid, aspartic acid, asparagine, glutamine or a substituted or 
^substituted aliphatic or axyl ester of glutamic acid or aspartic acid; 
AAis» is aspartic acid s asparagine, glutamic acid, glutamine, leucine* valine, isoleucine, 
methionine or a substituted or unsnbsti&sted aliphatic or aryl ester of glutamic acid or 
aspartic acid; 

AA^o is valine, arginine, leucine, isoleticine, nssthionine, ornithine, lysine, N~ 
nitroarginine, B-cycloargsnine, Y-hydroxyarginine, N-amidinocitruline or 2~ammo-4~ 
guarudinofouianoic acid; 

AAai is alanine, threonine, glycine, cysteine or serine; 
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AA22 is alanine, threonine, glycine, cysteine or serine; 
AA23 is histidine,, serine, threonine, cysteine, lysine or ornithine; 
AA^4 is threonine, aspartie acid, serine, glutamic acid or a substituted or unsubstitnted 
aliphatic or aryl ester of glutaraic acid or aspartie acid; 
5 AA>$ is asparagine, aspartie aesd„ glutamic acid, glutanune* leucine, valine, 
isoleucine, methionine or a substituted or unsubstituted aliphatic or atyS ester of 
glutamic acid or aspartie acid; and 

AAa^is cysteine, histidine, serine, threonine, lysine or ornithine. 

It is to be understood that these amino acid substitutions may be made for 
10 longer or shorter peptides than the 26 mer in the preceding example above, and for 
proteins. 

In one embodiment of the present invention, codons for the first several N~ 
terminal amino acids of the transposase are modified such that the third base of each 
codon is changed to an A or a T without changing the corresponding amino acid. It is 

15 preferable that between approximately 1 and 20, more preferably 3 and 15, and most 
preferably between 4 and 12 of the first N-termma! codons of the gene of interest are 
modified such that the third base of each codon is changed to an A or a T without 
changing the corresponding amino acid. In one embodiment, the first ten N-termmal 
codos^s of the gene of interest are modified in this manner* 

20 When several desired proteins, protein fragments or peptides are encoded in 

the gene of interest to be incorporated into the genome, one of skill in the art will 
appreciate that the proteins, protein fragments or peptides may be separated by a 
spacer molecule snch as, for example, a peptide, consisting of one or more amino 
acids. Generally, the spacer will have no specific biological activity other than to join 

25 the desired proteins, protein fragments or peptides together, or to preserve some 
minimum distance or other spatial relationship between them, However*, the 
constituent amino acids of the spacer may be selected to influence some property of 
the molecule such as the folding, net charge, or hydrophobicity. The spacer may also 
be contained within a nucleotide sequence with a purification handle or be flanked by 

30 proteolytic cleavage sites, 

Such polypeptide spacers may have from aboat 5 to about 40 amino acid 
residues. The spacers in a polypeptide are independently chosen, but are preferably 
all the same. The spacers should allow for flexibility of movement hi space and are 
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therefore typically rich in small amino acids, for example, glycine, serine, proline or 
alanine. Preferably, peptide spacers contain at feast 60%* more preferably at least 
80% glycine or alanine. In addition, peptide spacers generally have little or no 
biological and antigenic activity. Preferred spacers are (Giy-Pro-Gly-GIy)* (SEQ ID 
5 NO:5) and (Gly<rSer)y, wherein x is an integer from about 3 to about 9 and y is an 
integer from about 1 to about 8. Specific examples of suitable spacers include 
(Gly-Pro-Giy-Giyb 

SEQ ID NO:6 Gly Fro Giy Gly Gly Pro Gly Giy Gly Pro Giy Giy 
(Glyr-Serfo 

10 SEQ ID NO:7 Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Giy Ser 
or (Gly'4"Ser)4 

SBQ ID NO;8 Gly Gly Giy Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
Gly Gly Gly Gly Ser, 

Nucleotide sequences encoding for the production of residues which may be 

IS useful in pisritleation of the expressed recombinant protein may also be built into the 
vector. Such sequences are known in the ait and include the glutathione binding 
domain from glutathione S-transferase, polylysine, hexa-histidine or other cationic 
amino acids, thioredoxin, hemagglutinin antigen and maltose binding protein. 

Additionally* nucleotide sequences may be inserted into the gene of interest to 

20 be incorporated so that the protein or peptide can also include from one to about six 
amino acids that create signals for proteolytic cleavage. In this manner, if a gene is 
designed to make one or more peptides or proteins of interest in the transgenic animal, 
specific nucleotide sequesnces encoding for amino acids recognized by ensymes may 
be incorporated into the gene to facilitate cleavage of the large protein or peptide 

25 sequence into desired peptides or proteins or both. For example, nucleotides encoding 
a proteolytic cleavage site can be introduced into the gene of interest so that a signal 
sequence can be cleaved from a protein or peptide encoded by the gene of interest. 
Nucleotide sequences encoding other amino acid sequences which display pH 
sensitivity or chemical sensitivity may also be added to the vector to facilitate 

30 separation of the signal sequence from the peptide or protein of interest. 

In one embodiment of the present invention* a TAG sequence is linked to the 
gene of interest. The TAG sequence serves three purposes: 1) it allows free rotation 
of the peptide or protein to be isolated so there is no interference from the native 
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protein or signal sequence, i.e. vitellogenin, 2) St provides a "purification handle" to 
isolate the protein using column purification, and 3} it includes a cleavage site to 
remove the desired protein from the signal and purification sequences. Accordingly, 
as used herein, a TAG sequence includes a spacer sequence, a purification handle and 
a cleavage site. The spacer sequences in the TAG proteins contain one or more 
repeats shown in SEQ ID NO:25. A preferred spacer sequence comprises the 
sequence provided in SEQ ID NO:26. One example of a purification handle is the 
gp42 hairpin loop from HIV I. Exemplary gp41 polynucleotide and polypeptide 
sequences are provided in SEQ ID NO.-24 and SEQ ID NO:23 s respectively, 
However, it should be understood feat any antigenic region may be used as a 
purification handle, including any antigenic region of gp41. Preferred purification 
handles are those that elicit highly specific antibodies, Additionally, the cleavage site 
can be any protein c leavage site known to one of ordinary skill in fee art and includes 
an enierokinase cleavage site comprising the Asp Asp Asp Asp Lys sequence (SEQ 
ID NO:9) and a furin cleavage site. Constructs containing a TAG sequence are shown 
in Figures 2 and 3. In one embodiment of the present invention, the TAG sequence 
comprises a polynucleotide sequence of SEQ ID NO;22. 
M i^lj Sl kM Aato foistcriBg Traft^ ason-g ased Vectors. 

In addition to the transposon-based vectors described above, the present 
20 invention also includes methods of administering fee transposon-based vectors to an 
animal, methods of producing a transgenic animal wherein a gene of interest is 
incorporated into the germline of the animal and methods of producing a transgenic 
ansmal wherein a gene of interest is incorporated into cells other than the germline 
cells of the animal. The transposon-based vectors of the present invention may foe 
administered to an animal via any method knowu to those of skill in the art, including, 
but not limited to, intraembryanic, intratestictibr, intraoviduct, intraperitoneal, 
intraarterial, intravenous, topical, oral, nasal, and pronoclear injection methods of 
administration, or any combination thereof. The transposon-based vectors may also 
be administered within the lumen of an organ, into an organ, into a body cavity, into 
the cerebrospinal fluid, through the urinary system or through any route to reach the 
desired cells. 

The transposon-based vectors may be delivered through the vascular system to 
be distributed to the cells supplied by that v esse ]. For example, the compositions may 
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be placed in the artery supplying the ovary or supplying the fallopian tube to transfect 
cells in those tissues. In this manner, follicles could be transfeeted to create a 
germline transgenic animal. Alternatively, supplying the compositions through the 
artery leading to the oviduct would preferably transfect the tubular gland and 
5 epithelial cells. Such transfeeted cells could manufacture a desired protein or peptide 
for deposition in the egg white. Administration of the compositions through the portal 
vein would target uptake and transformation of hepatic cells. Administration through 
the urethra and into the bladder would target the transitional epithelium of the bladder. 
Administration through the vagina and cervix would target the lining of the uterus. 

10 Administration through the internal mammary artery would transfect secretory cells of 
the lactating mammary gland to perform a desired function, such as to synthesize and 
secrete a desired protein or peptide into the milk. 

In a preferred embodiment, the animal is an egg-laying animal, and more 
preferably, am avian, Isi one embodiment, between approximately 1 and 50 u.g> 

15 preferably between 1 and 20 £ig 8 and more preferably between 5 and 10 fig of 
transposon-hased vector DNA is administered to the oviduct of a bird. Optimal 
ranges depending upon the type of bird and the bird's stage of sexual maturity. 
Intraoviduct administration of the transposon-based vectors of the present invention 
result in a FCR positive signal in the oviduct tissue, whereas intra vase alar 

20 administration results in a FCR positive signal in the liver. In other embodiments, the 
transposon-based vector is administered to an artery that supplies the oviduct or the 
liver. These methods of administration may also be combined with any methods for 
facilitating iransfection, including without limitation, eleetroporation, gene gums, 
injection of naked DNA, and use of dimethyl sulfoxide (DMSG). 

25 The present invention includes a method of iniraembryonic administration of a 

transposon-based vector to an avian embryo comprising the following steps: 1) 
incubating an egg on its side at room temperature for two hours to allow the embryo 
contained therein to move to top dead center (TDC); 2) drilling a hole through the 
shell without penetrating the underlying shell membrane; 3) injecting the embryo with 

30 the transposon-based vector in solution; 4) sealing the hole in the egg; and 5) placing 
the egg in an incubator for hatching. Administration of the transposon-based vector 
can occur anytime between immediately after egg lay (when the embryo is at Stage X) 
and hatching. Preferably, the transposon-based vector is administered between 1 and 
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7 days after egg lay, more preferably between I and 2 days after egg lay. The 
transposors-hased vectors may be introduced into the embryo in amounts ranging from 
about 5.0 pg to 10 pg s preferably LQ ^.tg to 100 pg. Additionally, the transposon- 
hased vector solution volume may be between approximately I \il to 75 nj in quail 
5 and between approximately 1 jjtf to 500 n! In chicken. 

The present invention also includes a method of intratest&cular administration 
of a transposon-foased vector including injecting a bird with a composition comprising 
the trasispeson-based vector, an appropriate carrier and an appropriate transection 
reagent. In one embodiment, the bird is injected before sexual maturity, preferably 

10 between approximately 4-14 weeks, more preferably between approximately 6-14 
weeks and most preferably between 8-12 weeks old. In another embodiment,- a 
mature bird is injected with a tran&poson-based vector an appropriate earner and an 
appropriate transfection reagent. The mature bird may be any type of bird* but in one 
example the mature bird is a quail , 

1 5 A bird is preferably injected prior to the development of trie hlood-testis 

barrier, which thereby facilitates entry of the transposon-hased vector into the 
seminiferous tubufes and transfection of the spermatogonia or other gerrnlme cells. 
At and between the ages' of 4, 6, 8, 10 ; 12, and 14 weeks, it is believed that the testes 
of chickens are likely to be most receptive to transfection. In this age range, the 

20 blood/testis barrier has not yet formed, and there is a relatively high number of 
spermatogonia relative to the numbers of other cell types, e.g., spermatids, etc. See J, 
Kumaran et at, 1949. Poultry Sci, 29:511-520, See also E. Oakberg, 1956. Am. J. 
Anatomy, 99:507-515; arad P. Kluin et al., 1984, Anal EmbryoL, 169:73-78. 

The iransposon-based vectors may be introduced into a testis in an amount 

25 ranging froBn about OA $ig to 10 jig, preferably 1 to 10 |ig, more preferably 3 \xg to 
10 jig. In a quail, about 5 jig is a preferred amount. In a chicken, about 5 to 10 [ig 
per testis is preferred. These amounts of vector DNA may be injected in one dose or 
multiple doses and at one site or multiple sites in the testis, in a preferred 
embodiment, the vector DNA is administered at multiple sites in a single testis, both 

30 testes being injected in this snaxmer. In one embodiment* injection is spread over 
three injection sites: one at each end of the testis, and one in the middle. Additionally, 
the transposon-based vector solution volume may be between approximately I \i\ to 
75 jil in quail and between approximately 1 pd to 500 pi in chicken. In a preferred 
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embodiment, the transposon-hased vector solution volume may be between 
approximately 20 |i! to 60 $1 m quail and between approximately 50 \il to 250 \il In 
chicken. Both the amount of vector DNA and the total volume injected into each 
testis may be determined based upon the age and size of the bird. 
5 According to the present invention, the transpoaon-bascd vector is 

administered in conjunction with an acceptable earner and/or infection reagent. 
Acceptable carriers include, but are not limited to, water, saline, Hanks Balanced Salt 
Solution (BBSS), Tns-BDTA (TE) and lyotropic liquid crystals, Transfeetion 
reagents commonly known to one of ordinary skill in the art that may be employed 

10 include, but are not limited to 9 the following: cationie lipid transfbetion reagents, 
cationie lipid mixtures, poiyamine reagents, liposomes and combinations thereof; 
SUPERFECT®, Cytofectene, BioPORTER®, GenePQRTER®, NeisroFORTERD, 
and perfecfin from Gene Therapy Systems; lipofeetamme, cellfectm, DMRJE-C 
oligofectamine, anii PLUS reagent from inVitrogen; Xtrerne gene, fugene s DOSPER 

15 and DOTAP from Roche; Lipotsxi and Genejammer from Strategene; and Escort 
from SIGMA. In one embodiment, the transfection reagent is SUPERFECT^ The 
ratio of DNA to transfection reagent may vary based upon the method of 
administration. In one embodiment, the transposon-based vector is administered 
intratestieularly and the ratio of DNA to transfec tion reagent can be from 1:1.5 to 

20 1:15, preferably 1:2 to 1:10, ail expressed as wt/vol. Transfection may also be 
accomplished using other means knows to one of ordinary skill in the art. ijicluding 
without limitation electroporation, gene guns, injection of naked DNA, and t3se of 
dimethyl sulfoxide (DMSO)< 

Depending upon the cell or tissue type targeted for tmnsfection, the form of 

25 the transposon-based vector may be important. Flasmids harvested from bacteria are 
generally closed circular supercoiled molecules, and this is the preferred state of a 
vector for gene delivery because of the ease of preparation. In some instances, 
transposase expression and insertion may be more efficient in a relaxed, closed 
circular configuration or in a linear configuration, in still other instances* a purified 

30 transposase protein may be co-injected with a transposon-based vector containing the 
gene of interest for more immediate insertion. This eordd be accomplished by using a 
transfection reagent eomplexed with both the purified transposase protein and the 
transposon-hased vector. 

45 




WO 2004/003157 PCT/US2003/020389 




Following administration of a rransposon-based vector to an animal, DNA is 
extracted from the animal to confirm integration of the gene of interest. Actual 
frequencies of integration are estimated both by comparative strength of the PCR 
5 signal, and by histological evaluation of the tissues by quantitative PCR. Another 
method for estimating the rate of transgene insertion is the so-called primed in situ 
hybridization technique (FRINS), This method determines not only which cells carry 
a transgene of interest, hut also into which chromosome the gene has inserted, and 
even what portion of the chromosome. Briefly, labeled primers are annealed to 

10 chromosome spreads (affixed to glass slides) through one round of PCR, and the 
slides are then developed through normal in situ hybridization procedures. This 
technique combines the best features of in situ PGR and fluorescence in situ 
hybridization (FISH) to provide distinct chromosome location and copy number of the 
gene in question. The 28s rENA gene will be used as a positive control for 

1 5 spermatogonia to confirm that the technique is functioning properly. Using different 
fluorescent labels for the transgene and the 28s gene causes cells containing a 
transgene to fluoresce with two different colored tags. 

Breeding experiments are also conducted to determine if gerrnHne 
transmission of the transgene has occurred. In a general bird breeding experiment 

20 performed according to the present invention, each male bird was exposed to 2-3 
different adult female birds for 3-4 days each. This procedure was continued with 
different females for a total period of 6-12 weeks. Eggs were collected daily for up to 
14 days after the last exposure to the transgenic male, and each egg was incubated in a 
standard incubator. In the first series of experiments the resulting embryos were 

25 examined for transgene presence at day 3 or 4 using FCR. 

Any male producing a transgenic embryo was bred to additional females. 
Eggs from these females were incubated, hatched, and the chicks tested for the 
exogenous DNA, Any embryos that died were neeropsied and examined directly for 
the transgene or protein encoded by the transgene, either by fluorescence or PCR. 

30 The offspring that hatched and were found to be positive for the exogenous DNA 
were raised to maturity. These birds were bred to produce further generations of 
transgenic birds, to verify efficiency of the transgenic procedure and the stable 
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incorporation of the transgene into the germ line. The resulting embryos were 
examined for transgene presence at day 3 or 4 using PGR, 

It is to be understood that the above procedure can be modified to suit animals 
other than birds and that selective breeding techniques may be performed to amplify 
5 gene copy numbers and protein output, 



In one embodiment, the tansposon-based vectors of the present invention may 
be administered to a bird for production of desired proteins or peptides in the egg 
white, These trasnposon-based vectors preferably contain one or more of an 

10 ovalbumin promoter, an ovomucoid promoter, an ovalbumin signal sequence and an 
ovomucoid signal sequence. Oviduct -specific ovalbumin promoters are described in 
B. O'Malley et aL s 19S7, EMBO L y vol 6, pp, 2305-12; A. Qiu eta!., 1994. Proe, Nat 
Acad. Soi. (USA), vol. 91, pp, 4451-4455; D. Monroe et a!., 2000. Biochim. Biophys, 
Acta, 151? (l):27-32; H. Park et ah, 2000, Biochem., 39:8537-8545; and T« 

15 Mur&matsu et aL, 1996, Poult. Avian Biol. Rev,, 6:107-123. Examples of transposon- 
based vectors designed for production of a desired protein in an egg white are shown 
in Figures 2 and 3. 
Sggductionqf £>e 

The present invention is particularly advantageous for production of 
20 recombinant peptides and proteins of low solubility in the egg yolk, Such proteins 
include, but are not limited to, membrane-associated or membrane-bound proteins, 
lipophilic compounds; attachment factors, receptors, and components of second 
messenger transduction machinery. Low solubility peptides and proteins are 
particularly challenging to produce using conventional recombinant protein 
25 production techniques (cell and tissue cultures) because they aggregate in water- 
based, hydrophilic environments. Such aggregation necessitates denaturation and re- 
folding of the reeombinantly-produced proteins, which may deleteriously affect their 
structure and function. Moreover, even highly soluble recombinant peptides and 
proteins may precipitate and require denataation and renaturation when produced in 
30 sufficiently high amounts in recombinant protein production systems. The present 
invention provides an advantageous resolution of the problem of protein and peptide 
solubility during production of large amounts of recombinant proteins. 
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In one embodiment of the present invention, deposition of a desired protein 
into the egg yolk is accomplished by attaching a sequence encoding a protein capable 
of binding to the yolk vitellogenin receptor to a gene of interest that encodes a desired 
protein. This transposon-based vector can be used for the receptornmedi^ted uptake 
5 of the desired protein by the oocytes. In a preferred embodiment, the sequence 
ensuring the binding to the vitellogenin receptor is a targeting sequence of a 
vitellogenin protein. The invention encompasses various vitellogenin proteins and 
their targeting sequences. In a preferred embodiment, a chicken vitellogenin protein 
targeting sequence is used, however, due to the high degree of conservation among 

10 vitellogenin protein sequences and known cross-species reactivity of vitellogenin 
targeting sequences with their egg-yolk receptors, other vitellogenin targeting 
sequences can be substituted. One example of a construct for use in the transposon- 
based vectors of the present invention and for deposition of an insulin protein in an 
egg yolk is provided in SEQ ID NO:27. In this embodiment, the tran^flon-tascd 

15 vector contains a vitellogenin promoter, a vitellogenin targeting sequence, a TAG 
sequence, a pro-insulin sequence and a synthetic polyA sequence. The present 
invention includes, but is not limited to, vitellogenin targeting sequences residing in 
the N-terminal domain of vitellogenin, particularly in lipovitellin L In one 
embodiment, the vitellogenin targeting sequence contains the polynucleotide 

20 sequence of SEQ ID NO : 1 8 , 

in a preferred embodiment, the transposon-based vector contains - a transposase 
gene operably4inked to a liver-specific promoter and a gene of interest operably- 
Vmked to a liver-specific promoter and a vitellogenin targeting sequence. Figure 4 
shows an example of such a construct. In another preferred embodiment, the 

25 transposon-based vector contains a tansposase gene operably-linked to a constitutive 
promoter and a gene of interest operably-Hnked to a liver-specific promoter and a 
vitellogenin targeting sequence. 

For large-scale production of protein* an animal breeding stock that is 
30 homozygous for the transgene is preferred. Such homozygous individuals are 
obtained and identified through, for example, standard animal breeding procedures or 
PGR protocols. 
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Once expressed, peptides^ polypeptides and proteins can be purified according 
to standard procedures known to one of ordinary skill in the art, including ammonium 
sulfate precipitation, affinity columns, column claromatography, gel electrophoresis, 
high performance liquid chromatography, immunoprecipitation and the like. 
Substantially pure compositions of about 50 to 99% homogeneity are preferred, and 
80 to 95% or greater homogeneity are most preferred for use as therapeutic agents. 

In one embodiment of the present invention, the animal in which the desired 
protein is produced is an egg-laying animai In a preferred embodiment of the present 
invention, the animal is an avian arid a desired peptide, polypeptide or protein is 
isolated from an egg white. Egg white containing the exogenous protein or peptide is 



on an 

variety of methods known in the egg industry. See, e.g., W, Stadeiman et al (Eds,), 
Egg Science Sc Technology, Haworth Press, Binghamton, NY (1993). Isolation of the 
exogenous peptide or protein from the other egg white constituents is accomplished 
by any of a number of polypeptide isolation and purification methods well known to 
one of ordinary skill in the art. These techniques include, for example, 
chromatographic methods such as gel permeation, ion exchange, affinity separation, 
metal chelation, HPLC, and the like, either alone or in combination. Another means 
that may be used for isolation or purifications either in lieu of or in addition to 
chromatographic separation methods, includes electrophoresis. Successful isolation 
and purification is confirmed by standard analytic techniques 9 including HPLC, mass 
spectroscopy, and spectrophotometry. These separation methods are often facilitated 
if the first step m the separatiox? is the remo val of the endogenous ovalbumin fraction 
of egg white, as doing so will reduce the total protein content to be further purified by 



To facilitate or enable purification of a desired protein or peptide, transposon- 
based vectors may include one or more additional epitopes or domains. Such epitopes 
or domains include DNA sequences encoding enzymatic or chemical cleavage sites 
including, but not limited to, an enterokin&se cleavage site; the glutathione binding 
domain from glutathione S-tran&ferase; polylysine; hexa-histidme or other cationic 
amino acids; thioredoxin; hemagglutinin antigen; maltose binding protein; a fragment 
of gp41 from HIV; and other purification epitopes or domains commonly known to 
one of skill in the art. 
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In one representative embodiment, purification of desired proteins from egg 
white utilizes the antigenicity of the ovalbumin carrier protein and particular attributes 
of a TAG linker sequence that spans ovalbumin and the desired protein. The TAG 
sequence is particularly useful in this process because it contains !) a highly antigenic 
5 epitope, a fragment of gp41 from HIV, allowing for stringent affinity purification, 
and, 2) a recognition site tor the protease enterokinase immediately juxtaposed to the 
desired protein. in a preferred embodiment, &e TAG sequence comprises 
approximately 50 amino acids; A representative TAG sequence is provided below. 

10 Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Fro Ala Asp Asp Ala Fro Ala Asp Asp 
Ala Fro Ala Asp Asp Ala Pro Ala Asp Asp S&B^^ 
Gly Trp He Gjy Leu Leu Asp Asp Asp Asp Lys (SEQ ID NO: 22) 

The underlined sequences were taken from the hairpin loop domain of HIV gp~41 
IS (SEQ ID NO:23). Sequences in italics represent the cleavage site for entexokinase 
(SEQ ID NO:9). The spacer sequence upstream of the loop domain was made from 
repeats of (Pro Ala Asp Asp Ala) (SEQ ID NO:25) to provide free rotation and 
promote surface availability of the hairpin loop from the ovalbumin carrier protein. 
Isolation and purification of "a desired protein is performed as follows: 
20 L Enrichment of the egg white protein fraction containing ovalbumin and the 

transgenic ovatbumin^TAG'-deisnred protein, 
2„ Size exclusion chromatography to isolate only those proteins within a narrow 
range of molecular weights (a further enrichment of step 1). 

3, Ovalbumin affinity chromatography. Highly specific antibodies to ovalbumin 
25 wt ® eliminate virtually all extraneous egg white proteins except ovalbumin 

and the transgenic ovalbumin-T AG-desired protein, 

4. gp41 affinity chromatography using anti-gp41 antibodies, Stringent 
application of this step will result in virtually pure transgenic ovalbumin- 
T AG-desired protein. 

30 5. Cleavage of the transgenic product can be accomplished in at least one of two 

ways: 

a. The transgenic ovalbmnin-TAG~desired protein is left attached to the 
gp41 affinity resin (beads) from step 4 and the protease enterokinase is 
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added. 'This liberates the transgene target protein from the gp4i affinity 
resist while the ovalbumin-TAG sequence is retained. Separation by 
eentri&gation (in a batch process) or flow through (in a column 
purification), leaves the desired protein together with enterokinase in 
5 solution. Bnterokinase is recovered and reused. 

b, Alternatively, enteroklnase is immobilized on resin (beads) by the 
addition of po!y4ysine moieties to a non-catalytic area of the protease. 
The transgenic ovalbumisi-TAGKiesired protein eluted from the 
affinity column of step 4 is then applied to the protease resin. Protease 

10 action cleaves the ovalbumin-TAG sequence from the desired protein 

and leaves both entities in solution. The immobilized enterokinase 
resin is recharged and reused. 

c. The choice of these alternatives is made depending upon the size and 
chemical composition of the transgene target protein, 

IS 6, A final separation of either of these two (3a or 5b) protein mixtures is made 

using size exclusion, or enterokinase affinity chromatography. This step 
allows for desalting, buffer exchange and/or polishing, as needed. 
Cleavage of the transgene product (ovalbumin-TAG-desired protein) by 
enterokinase, then* results in two products: ovalbumin~TAG and the desired protein. 

20 More specific methods for isolation using the TAG label is provided in the Examples. 
Some desired proteins may require additions or modifications of the above-described 
approach as known to one of ordinary skill in the art. The method is scaleable from 
the laboratory bench to pilot and production facility largely because the techniques 
applied are well documented in each of these settings. 

25 It is believed that a typical chicken egg produced by a transgenic animal of the 

present invention will contain at least 0,001 mg„ from about 0.001 to 1,0 nig, off from 
about 0,001 to 100.0 mg of exogenous protein, peptide or polypeptide, in addition to 
the normal constituents of egg white (or possibly replacing a small traction of the 
latter), 

30 One of skill in the art will recognize that after biological expression or 

purification, the desired proteins, fragments thereof and peptides may possess a 
conformation substantially different than the native conformations of the proteins, 
fragments thereof and peptides. In this case, it is often necessary to denature and 
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reduce protein and then to cause the protein to re-fold into the preferred conformation. 
Methods of reducing and denaturing proteins and inducing re-folding are well known 
to those of skill in the art. 
Production of Protein or Peptide in Milk 
5 In addition to methods of producing eggs containing transgenic proteins or 

peptides, the present invention "encompasses methods for the production of milk 
containing transgenic proteins or peptides. These methods include me administration 
of a transposon-based vector described above to a mammal, In one embodiment, the 
transposon-based vector contains a transpesase operably-Imked to a constitutive 
10 promoter and a gene of interest operahly-linked to mammary specific promoter. 
Genes of interest can include, but are not limited to antiviral and antibacterial proteins 
and immunoglobulins. 

In addition to production and isolation of desired molecules, the transposon- 

1 5 based vectors of the present invention can be used for the treatment of various genetic 
disorders. For example, one or more transposon-based vectors can be administered to 
a human or animal for the treatment of a single gene disorder including, but not 
limited to, Huntington's disease, alpha-! -antitrypsin deficiency Alzheimer's disease, 
various forms or breast cancer, cystic fibrosis, galactosemia, congenital 

20 hypothyroidism, maple syrup urine disease, neurofibromatosis 1, phenylketonuria, 
sickle cell disease, and S m i th-Lem H-Opitz (SLO/RSH) Syndrome. Other diseases 
caused by single gene disorders that may be treated with the present invention 
include, autoimmune diseases, snipping lever in cattle, mastitis, bacterial or viral 
diseases, alteration of skin pigment in animals, In these embodiments, the 

25 transposon-based vector contains a non-mutated, or non-disease causing form of the 
gene known to cause such disorder. Preferably, the transpos&se contained within the 
tnmsposase-hased vector is operably linked to an inducible promoter such as a tissue- 
specifsc promoter such that the non-mutated gene of interest is inserted into a specific 
tissue wherein the mutated gene is expressed in vivo. 

30 In one embodiment of the present inventions, a transpo son-based vector 

comprising a gene encoding fnoinsulin is administered to diabetic animals or humans 
ibr incorporation into liver cells in order to treat or cure diabetes. The specific 
incorporation of the proinsulin gene into She liver is accomplished by placing the 
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transpos&se gene under the control of liver-specific promoter, such as G6F. This 
approach is useful for treatment of both Type I and Type H diabetes. The Q6F 
promoter has been shown to be glucose responsive (Arguad, et al. 1996. Diabetes 
45: 1563-} 57 !)> and thus, glucose-regulated insulin production is achieved using DNA 
5 constructs of the present invention. Integrating a proinsulin gene into liver cells 
circumvents the problem of destruction of pancreatic islet cells m the course of Type 1 
diabetes, 

In another embodiment, shortly after diagnosis of Type I diabetes, the cells of 
the immune system destroying pancreatic fj-cells are selectively removed using the 

1 0 transposon«based vectors of the present invention, thus allowing norma! 3-ceI3s to 
repopuiate the pancreas. 

For treatment of Type II diabetes, a tr&nsposon-based vector containing a 
proinsulin gene is specifically incotporated into die pancreas by placing the 
transposase gene under the control of a pancreas-specific promoter, such as an insulin 

1 5 promoter. In this embodiment, the vector is delivered to a diabetic animal or human 
via injection into an artery feeding the pancreas. For delivery, the vector is 
compiexed with a transection agent The artery distributes the complex throughout 
the pancreas, where individual cells receive the vector DNA. Following uptake into 
the target ce!3 s the insulin promoter is recognized by transcriptional machinery of the 

20 cell, the transpos&se encoded by the vector is expressed, and stable integration of the 
proinsuiin gene occurs. It is expected that a small percentage of the transposon-b&sed 
vector is transported to other tissues, and that these tissues are Sransfected. However, 
these tissues are not stably transfeeted and the proinsulin gene is not incorporated into 
the cells' DNA due to failure of these cells to activate the insulin promoter. The 

25 vector DNA is likely lost when the cell dies or degraded over time. 

In other embodiments,, one or more transposon-based vectors are administered 
to an avian for the treatment of a viral or bacterial infection/disease including, but not 
limited to, ColifeaciHosis (Coliform infections), Mycoplasmosis (CRD„ Air sac, 
Sinusitis), Fowl Cholera, Necrotic Enteritis* Ulcerative Enteritis (Quail disease), 

30 Pullorum Disease, Fowl Typhoid, Botulism* Infectious Corysa, Erysipelas, Avian 
Pox s Newcastle Disease, Infectious Bronchitis, Quail Bronchitis, Lymphoid Leukosis, 
Marek*s Disease (Visceral leukosis), Infectious Bursal Disease (Gurnboro), In these 
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embodiments, the transposon-based vectors may be used in a maimer similar to 
traditional vaccines, 

In still other embodiments, one or more transposon-feased vectors are 
administered to an animal for the .. production of an amima! with enhanced growth 
5 characteristics and nutrient btiSkation. 

THe tran^oson-based vectors of the present invention can be used to 
transform any animal cell, including but not limited to: cells producing hormones, 
cytokir.es, growth factors, or any other biologically active substance; cells of the 
immune system; cells of the nervous system; muscle {striatal, cardiac, smooth) cells; 
vascular system cells; endothelial cells; skin cells; mammary cells; and lung cells, 
including bronchial and alveolar cells. Transformation of any endocrine cell by a 
transposon-bascd vector is contemplated as a part of a present invention. In one 
aspect of the present invention, cells of the immune system may be the target for 
incorporation of a desired gene or genes encoding for production of antibodies. 
Accordingly, the thymus, bone marrow, beta lymphocytes (or B cells), gastrointestinal 
associated lymphatic tissue (GALT), Peyer's patches, bursa Fabricius, lymph nodes, 
spleen, and tonsil, and any otiier lymphatic tissue, may all be targets for 
administration of the compositions of the present invention. 

The tansposon-based vectors of the present invention can be used to modulate 
(stimulate or inhibit) production of any substance, including but not limited to a 
hormone, a cytokine, or a growth factor, by an animal or a human cell. Modulation of 
a regulated signal within a cell or a tissue, such as production of a second messenger, 
is also contemplated as a part of the present invention. Use of the tamsposon-based 
vectors of the present invention is contemplated for treatment of any animal or human 
disease or condition that results from underproduction (such as diabetes) or . 
overproduction (such as hyperthyroidism) of a hormone or other endogenous 
biologically active substance. Use of the transposon-hased vectors of the present 
invention to integrate nucleotide sequences encoding RNA molecules, such as anti- 
sense RNA or short interfering SNA, is also contemplated as a part of the present 
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Additionally, the transposon-based vectors of the present invention may be 
used to provide cells or tissues with "beacons", such as receptor molecules, for 
binding of therapeutic agents in order to provide tissue and cell specificity for the 
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therapeutic agents, Several promoters and exogenous genes can be coxnbined in one 
vector to produce progressive, controlled treatments from a single vector delivery. 

The following examples will serve to further illustrate the present invention 
without, at the same time, however, constituting any limitation thereof. On the 
5 contrary, it is to be clearly understood that resort may be had to various embodiments, 
modifications and equivalents thereof which, after reading the description herein, may 
suggest themselves to those skilled in the art without departing from the spirit of the 
invention, 

10 EXAMPLE 1 

Preparation o/Transposon-Based Vector pTnMod 

A vector was designed for inserting a desired coding sequence into the 
genome of eukaryotic cells, given below as SBQ ID NO: 1 . Tire vector of SEQ ID 
NO; 1, termed pTrMod, was constructed and its sequence verified. 

1 5 This vector employed a cytomegalovirus (CMV) promoter, A modified Kozak 

sequence (A.CCATG) (SEQ ID NO: 1 3) was added to the promoter, lire nucleotide in 
the wobble position in nucleotide triplet codons encoding the first 10 amino acids of 
transposase was changed to an adenine (A) or thymine (T} 5 which did not alter the 
axnino acid encoded by this codon. Two stop codons were added and a synthetic 

20 polyA was used to provide a strong termination sequence. This vector uses a 
promoter designed to be active soon after entering the cell (without any induction) to 
increase the likelihood of stable integration. The additional stop codons and synthetic 
poiyA insures proper termination without read through to potential genes 
downstream, 

25 The first step in constructing this vector was to modify the transposase to have 

the desired changes, Modifications to the transposase were accomplished with the 
primers High Efficiency forward primer (Hef) Altered transposase (ATS)-Hef 5' 
ATCTCGAGACCATGTGTGAACTTGATATIlTACATGATrCTClTTACC 3 * 

(SEQ ID NO: 10) and Altered transposase- High efficiency reverse primer (Her) 5 y 
30 GATTOATCATTATCATAATTTCCCCAAAGCGTAACC 3 ? (SEQ ID NO:l! 9 a 
reverse complement primer). In the 5' forward primer ATS-Hef, the sequence 
CTCGAG (SEQ ED NO: 12) is the recognition site for the restriction enzyme Xbo I, 
which permits directior&al cloning of the amplified gene. The sequence ACCATG 
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(SEQ ID NO: 1 3) contains the Kozak sequence and start codon for the transposase and 
the underlined bases represent changes in the wobble position to an A or T of eodons 
for the first 10 amino acids (without changing the amino acid coded by the codon). 
Primer ATS-Her (SEQ ID NO:1 1) contains an additional stop codon TAA in addition 
5 to native stop codon TGA and adds a Bel I restriction site, TGATCA {SEQ ID 
NO: 14), to allow directional cloning. These primers were used in a PGR reaction 
with pToLac (p defines pfasmid, in defines transposes and lac defines the beta 
fragment of the lactose gene, which contains a multiple cloning site) as the template 
for the transpose and a FailSale™ PGR System (which includes ensyme, buffers, 

10 dNTF*s 5 MgCfa and FCR Enhancer; Epicentre Technologies, Madison, WI), 
Amplified FCR product was electrophotesed on a 1% agarose gel, stained with 
ethidium bromide, and visualized on an ultraviolet transilluminaior. A band 
corresponding to the expected size was excised from the gel and purified from the 
agarose using a Zymo Clean Gel Recovery Kit (Zymo Research, Orange, CA). 

15 Purified DNA was digested with restriction enzymes Xho I (5*) and Be] 1 (3 J ) (New 
England Biolabs, Beverly, MA) according to the manufacturer's protocol. Digested 
DNA was purified from restriction enzymes using a Zymo DNA Glean and 
Concentrator kit (Zymo Research). 

Plasmid gWhiz (Gene Therapy Systems, San Diego, CA) was digested with 

20 restriction enzymes Sal I and BamH I (New England Biolabs), which as*e compatible 
with Xho 1 and Bel I t but destroy the restriction sites. Digested gWhiz was separated 
on an agarose gel, the desired band excised and purified as described above. Cutting 
the vector in this manner facilitated directional cloning of the modified transposase 
(m ATS) between the CMV promoter and synthetic polyA. 

25 To insert the mATS between the CMV promoter and synthetic polyA in 

gWhb. s a Stratagene T4 Ligase Kit {Stratagene, Inc. La Jolla, CA) was used and the 
ligation set up according to the manufacturer's protocol Ligated product was 
transformed into E. coii ToplO competent cells (Xnvitrogen Life Technologies, 
Carlsbad, CA) using chemical transformation according to invitrogen's protocol, 

30 Transformed bacteria were inenbated in I ml of SOC (GIBCO BRL,.CAT# 15544- 
042) medium for 1 hour at 37° C before being spread to LB (Luria-Bertani media 
(broth or agar)) plates supplemented with 100 ^ig/nil ampiciilin (LB/amp plates). 
These plates were incubated overnight at 37° C and resulting colonies picked to 
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LB/amp broth for overnight growth at 37° C. Flasmid DNA was isolated using a 
modified alkaline lysis protocol (Samhrook et aL, 1989)* ekctrophoresed on a 1% 
agarose gel, and visualised on a IX V. tansiilumm&ior after ethidium bromide 
staining. Colonies producing a plasmid of the expected size (approximately 6.4 kbp) 
were cultured in at least 250 ml of LB/amp broth and plasrnid DNA harvested using a 
Qiagen Maxi-Frep Kit (column purification) according to the manufacturer's protocol 
{Qiagen, Inc., Chatswoith, CA), Column purified DNA was used as template for 
sequencing to verify the changes made in the transposase were the desired changes 
and no further changes or mutations occurred due to PGR amplification. For 
sequencing, Perkin-Elmer's Big Dye Sequencing Kit was used, All samples were sent 
to the Gene Probes and Expression Laboratory (LSU School of Veterinary Medicine) 
for sequencing on a Perkin-Elmer Model 377 Automated Sequencer, 

Once a clone was identified that contained the desired mATS m the correct 
orientation, primers CMVf-NgoM IV (5 } TTGCCGGCATCAGATTGGCTAT {SBQ 
ID NO: IS); underlined bases denote NgoM IV recognition site) and Syn-polyA-BstE 
II (5 ? AGAGGTCA^GGGTCAATTCTTCAGCACCIGOIA (SEQ ID NO: 1 6); 
underlined bases denote BstE H recognition site) were used to PGR amplify the entire 
CM V promoter, mATS, and synthetic paly A for cloning upstream of the transposon 
in pTnLac. The PGR was conducted with FailSafe™ as described above, purified 
using the Zymo Clean and Concentrator kit, the ends digested with NgoM IV and 
BstE II (New England Biolabs)* purified with the Zymo kit again and cloned upstream 
of the transposon in pTnLac as described below, 

Flasmid pTnLac was digested with NgoM IV and BstE II to remove the ptae 
promoter and transposase and the fragments separated on an agarose gel, The band 
corresponding to the vector and transposon was excised, purified from the agarose, 
and dephosphorykted with calf intestinal alkaline phosphatase (New England 
Bioiabs) to prevent self-annealing. Hie enzyme was removed from the vector using a 
Zyrno DNA Clean and Concentrator-5 . The purified vector and CMVp/mATS/polyA 
were ligated together using a Stratagene T4 Ligase Kit and transformed into E. coli as 
described above. 

Colonies resulting from this transformation were screened (mini-preps) as 
describe above and clones that were the correct size were verified by DNA sequence 
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analysis as described above, The vector was given the name pTnMod {SEQ ID NO: 1 ) 
and includes the following components: 

Base pairs 1-130 are a remaisidet of Fl(-) on from pBIuescriptll sk(~) 
(Stratagem), corresponding to base pairs 1-130 of pBIuescriptll sk(~). 
5 Base pairs 131 - 132 are a residue from ligation of restriction enzyme sites 

used in constructing the vector. 

Base pairs 133 -1777 are the CMV promoter/enhancer taker* from vector 
pGWiz (Gene Therapy Systems), corresponding to bp 229-1873 of pQWiz. The 
CMV promoter was modified by the addition of an ACC sequence upstream of ATG. 
10 Base pairs 1778-1779 are a residue from ligation of restriction ''enzyme sites 

used in constructing the vector. 

Base pairs 1780 - 2987 are the coding sequence for the ftranspos&se, modified 
from TnIO (GenBank accession J0I829) by optimizing codoas for stability of the 
transposase xnRNA and for the expression of protein. More specifically, m each of the 
1 5 codons for the first ten amino acids of the transposase, G or € was changed to A or T 
when such a substitution would not alter the amino acid that was encoded. 
Base pairs 2988-2993 are two engineered stop codons. 

Base pair 2994 is a residue from ligation of restriction enayme sites used in 
constructing the vector, 

20 Base ^ airs 2995 -3410 are a synthetic poiyA sequence taken from the pGWte 

vector (Gene Therapy Systems), corresponding to hp 1922-2337 of 10 pGWiz. 

Base pairs 3415 - 3718 are non-coding UNA that is residual from vector 
pNK2859. 

Base pairs 3719 - 3761 are non-coding X UNA that is residual from pNK2S59. 
Base pairs 3762 - 3831 are the 70 bp of the left insertion sequence recognized 
by the transposon Tn 1 0. 

Base pairs 3832-3837 are a residue from ligation of restriction enzyme sites 
used in constructing the vector. 

Base pairs 3838 -4527 are the multiple cloning site from pBIuescriptll sk(20)„ 
corresponding to bp 924-235 of pBIuescriptll sk(-). This multiple cloning site may he 
used to insert any coding sequence of interest into the vector. 

Base pairs 4528-4532 are a residue from ligation of restriction enzyme sites 
used in constructing the vector. 
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Base pairs 4533 - 4602 are the 70 bp of the right insertion sequence 
recognized by the transposon TnltL 

Base pairs 4603 - 4644 are non-coding X ON A that is residual from pNK2S59. 

Base pairs 4643 - 5488 are non-coding DNA that is residual from pNK2859, 
5 Base pairs 5489 ~ 7689 are from the pBInescriptll sk(-) base vector - 

(Stratagene, Inc.), corresponding to bp 761-2961 of pBIuescriptll sk(-). 

Completing pTnMod is a pBlueScript backbone that contains a colE I origin of 
replication and an antibiotic resistance marker (ampieillin). 

It should be noted that all non-coding DNA sequences described above can be 
10 replaced with any other non-eoding DNA sequence^). Missing nucleotide sequences 
in the above construct represent restriction site remnants. 

All plasmid DNA was isolated by standard procedures. Briefly, Escherichia 
coli containing the plasmid was grown in 500 mL aliquots of LB broth (supplemented 
with an appropriate antibiotic) at 31°C overnight with shaking, Plasmid DNA was 
IS recovered from the bacteria using a Qiagen Maxi-Prep kit {Qiagen 9 Inc., Chatsworth, 
CA) according to the manufacturer's protocol. Plasmid DNA was resuspended in 500 
pL of P€R~grade water and stored at -20°C until used. 

EXAMPLE 2 

20 Preparation of Transposon-Based Vector pTnMod (CMY/Red) 

A vector was designed for inserting a reporter gene (DsRed) under the control 
of the CMV promoter into the genome of vertebrate cells given below as SEQ ID 
NO:2. The reporter gene chosen was the DsRed gene, driven by the immediate early 
cytomegalovirus promoter, to produce a plasmid called pTnCMV/DsRed. The DsRed 

25 gene product is a red fluorescent protein from an IndoPacific sea anemone, 
Discosoma sp, 3 which fluoresces bright red at 558 nm, It is to be understood that the 
reporter gexie ? i*e. s the DsRed gene, is only one embodiment of the present invention 
and that any gene of interest may be inserted into the plasmid in place of the DsRed 
reporter gene in any Experiment described herein. 

30 The vector of SEQ ID NO:2, named pTnMod (CMY/Red), was constructed, 

and its sequence verified by re-sequencing. SEQ ID NQ:2, pTnMod (CMY/Red), 
includes the following components: 
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Base pairs 1-130 are a remainder of Fl(-) on from pBluescriptil sk(~) 
(Stratagenc), corresponding to bp 1*130 of pBIuescriptll sk(~). 

Base pairs 131 - 132 are a residue from ligation of restriction enzyme sites 
used in constructing the vector, 
5 Base pairs 133 -1777 are fee CMV promoter/enhancer taken from vector 

pGWiz (Gene Therapy Systems,, corresponding to bp 229~iS73 of pGWte. 

Base pairs 1778-1779 are a residue from ligation of restriction enzyme sites 
used in constructing the vector. 

Base pairs 1 780 - 2987 are the coding sequence for the tr&nsposase, modified 
10 from Tnl 0 (GenBank accession JO 182 9) by optimizing codons as discussed above. 

E*ase pairs 2988-2993 are two engineered stop codons. 

Base pair 2994 is a residue from ligation of restriction enzyme sites used in 
construe ting the vector. 

Base pairs 2995 - 3410 are a synthetic polyA sequence taken from the pGWiz 
1 5 vector (Gene Therapy Systems), corresponding to bp 1 922-2337 of pGWiz. 

Base pairs 3415 - 3718 are non-coding DNA that is residual from vector 
PNK2859. 

Base pairs 3719 - 3761 are non-coding X DNA that is residua! from pNK2859<> 
Base pairs 3762 - 3831 are the 70 bp of the left insertion sequence recognized 
20 by the transposon Tn 1 0. 

Base pairs 3832-3837 are a residue from ligation of restriction enzyme sites 
used m constructing the vector, 

Base pairs 3838 - 4044 are part of the multiple cloning site from pBluescriptil 
sk(-), corresponding to bp 924-7 18 of pBluescriptil sk(~), 
25 Base pairs 4045-4048 are a residue from ligation of restriction enzyme sites 

used in constructing the vector. 

Base pairs 4049-5693 are the CM¥ promoter/enhancer, taken from vector 
pGWiz (Gene Therapy Systems), corresponding to bp 229-1 S73 of pGWiz, 

Base pairs 5694-5701 are a residue from ligation of restriction enzyme sites 
30 used in constructing the vector. 

Base pairs 5702 ~ 6611 are the BsRed reporter coding sequence, including 
poly A sequence, from pDsRedl.l (Clontech), corresponding to bp 77 - 992 of 
pDsRedl.l. 
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Base pairs 6618 - 7101 are part of the multiple cloning site from pBluescriptll 
sk(-X corresponding to bp 718-235 of pBluescriptll sk(«). 

Base pairs 7102-7106 are a residue from ligation of restriction enzyme sites 
used in constructing the vector. 

5 Base pairs 7107 - 7176 are the 70 bp of the right insertion sequence 

recognized by the transposes Tn 10. 

Base pairs 7177 - 7218 are non-coding X DNA that is residual from pNK2859, 
Base pairs 7219 - S062 are non-coding DNA that is residual from pNK2859. 
Base pairs 8063 - 10263 axe from the pBluescriptll sk{~) base vector 
10 (Stratagene, Inc.), corresponding to bp 761-2961 of pBluescriptll sk(-}. 

It should be noted that all non-eoding DNA sequences described above can be 
replaced with any other non-coding DNA sequence^), 

EXAMPLE 3 

1 5 Preparation of Fransposon-Based Vector p TnMod (Oval/Red) - Chicken 

A vector was designed for inserting a reporter gene (DsRed) under the control 
of the ovalbumin promoter, and including the ovalbumin signal sequence, into the 
genome of a bird. One version of this vector is given below as SEQ ID NQ:3, The 
vector of SEQ ID NO:3 ? riamed pTsiMod (Oval/Red) ~ Chicken, includes chicken 

20 ovalbumin promoter and signal sequences. 

SEQ ID NO:3 ? pTnMod (Oval/Red) - Chicken, includes the following 
components: 

Base pairs 1=130 are a remainder of Fl(-) on from pBluescriptll §k(~) 
(Sfratagene), corresponding to bp 1-130 of pBluescriptll sk(-), 
25 Base pairs 131 - 132 are a residue from ligation of restriction enzyme sites 

used in constructing the vector. 

Base pairs 133 -177? are the CMV promoter/enhancer taken from vector 
pGWiz (Gene Therapy Systems, corresponding to bp 229-1873 of pGWk, 

Base pairs 1 778-1779 are a residue from ligation of restriction enzyme sites 
30 used in constructing the vector. 

Base pairs 1780 - 298? are the coding sequence for the tr&nsposase, modified 
from TniO (GenBank accession JO 1829} by optimising codons as discussed above. 

Base pairs 2988-2993 are two engineered stop eodous. 
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Base pan 2994 is a residue from ligation of restriction enzyme sites used in 
constructing the vector. 

Base pairs 2995 ~ 3410 are & synthetic polyA sequence taken from the pGWte 
vector (Gene Therapy Systems), corresponding to bp 1922-2337 of pGWiz. 
5 Base pairs 3413 —3718 are non-coding DNA that is residual from vector 

pNK2859. 

Base pairs 3719 - 3761 are non-coding X DNA that is residua! from 10 
pNK2859. 

Base pairs 3762 - 3831 are the 70 bp of the left insertion sequence recognised 
1 0 by the iransposon TnlO. 

Base pairs 3832-3837 are a residue from ligation of restriction enzyme sites 
used m constructing the vector, 

Base pairs 3 83 8 - 4044 are part of the multiple cloning site from pBhiescriptll 
sk(~)„ corresponding to hp 924-718 of pBluescriptlf sk{«). 
15 Base pairs 4045*4049 are a residue from ligation of restriction enzyme sites 

used in constructing the vector, 

Base pairs 4050 - 4951 contain upstream elements of the (including SDRE, 
steroid-dependent response element). See GenBank accession number J00895 
M24999, bp 431-1332. Base pairs 4952-4959 are a residue from ligation of 
20 restriction .enzyme sites used in constructing the vector. 

Base pairs 4960 - 5112 are the chicken ovalbumin signal sequence (GenBank 
accession number J00895 M24999, bp 2996-3148). 

Base pairs 5113-31 18 are a residue from ligation of restriction ensyme sites 
used in constructing the vector. 

25 Base pairs 5119 - 6011 are the DsRsd reporter coding sequence, including 

poly A sequence, from pDsRedl.l (Clontech), corresponding to bp 100 - 992 of 
pDsRed LI. 

Base pairs 6012-601? are a residue from ligation of restriction enzyme sites 
used in constructing the vector, 
50 Base pairs 6018 - 6056 are part of the multiple cloning site of the ZeroBlunt 

Topo cloning vector (Invitrogen), corresponding to- bp 337-377 of ZeroBlunt. 

Base pairs 6057-6062 are a residue from ligation of restriction enzyme sites 
used in constructing the vector. 
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Base pairs 6063 * 6495 are part of the multiple cloning site frmxx pBlueseripili 
sk(-> ? corresponding to bp 667-235 of pBluescriptll sk(~). 

Base pairs 6496-6500 are a residue fom ligation of restriction enzyme sites 
used m constructing the vector, 
5 Base pairs 6501 - 6570 are the 70 bp of the right insertion sequence 

recognised by the transposon TnlO. 

Base pairs 6571 - 6612 are non-coding % DNA that is residual from pNK2859. 

Base pairs 6613 - 7477 are non-coding DNA that is residua! from pNK2859* 

Base pairs 7478 - 961$ are from the pBluescriptll sk(«) base vector 
10 (Stratagene, Inc.), corresponding to bp 761-2961 of pBluescriptll sk(-)« 

It should be noted that ail non-coding DNA sequences described above can be 
replaced with any other non-coding DNA sequence(s). 



EXAMPLE 4 

1 5 Preparation of Transpos on-Based Vector pTnAfadf Oval/Red) - Quail 

A vector was designed for inserting a reporter gene (DsRed) under the control 
of the ovalbumin promoter, and including the ovalbumin signal sequence, into the 
genome of a bird given below as SEQ ID NG;4, The vector of SBQ ID NO:4, named 
pTnMod (Oval/Red) - Quail, has been constructed, and selected portions of the 
20 sequence have been verified by re-$£<guemeing< 

SEQ ID NQ:4, pTnMod (Oval/Red) - Quail, includes the following 
components: 

Base pairs 1-130 are a remainder of Fl(-) on from pBluescriptll sk(-) 
(Stratagene), corresponding to hp 3 -1 30 of pBluescriptll sk(~), 
25 Base pairs 131 - 132 are a residue from ligation of restriction enzyme sites 

used in constructing the vector. 

Base pairs 133 - 1777 are the CMV promoter/enhancer taken fern vector 
pGWis (Gene Therapy Systems), corresponding to hp 229-1 873 of pG Wiz, 

Base pairs 1 778-1779 are a residue from ligation of restriction enzyme sites 
30 used in constructing the vector. 

Base pairs 1780 - 2987 are the coding sequence for the transposase, modified 
from Tn 1 0 (GenB&nk accession J01829) by optimizing codons as discussed above. 
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Base pairs 2988-2993 are two engineered stop codons. Base pair 2994 is a 
residue from ligation of restriction enzyme sites used in contacting the vector. 

Base pairs 2995 « 3410 are a synthetic poly A sequence taken from the pGWiz 
vector {Gene Therapy Systems), corresponding to bp 1922-2337 of pGWiz. 
5 Base pairs 3415 - 3718 are non-coding DNA that is residual from vector 

pNK2%59. 

Base pairs 3719 - 3761 are non-coding X DNA that is residual from pNK2859. 
Base pairs 3762 -3831 are the 70 base pairs of the let! insertion sequence 
recognised by the transpose Tnl0\ 

10 Base pairs 3832-3837 are a residue Irom ligation of restriction enzyme sites 

used in constructing the vector. 

Base pairs 383S - 4044 are part of the multiple cloning site from pBluescriptll 
sk(-} 5 corresponding to bp 924-71 8 of pBluescriptll sk{«), 

Base pairs 4045-4049 are a residue from ligation of restriction en^e sites 
1 5 used in constructing the vector. 

Base pairs 4050 - 4934 are the Japanese quail ovalbumin promoter (including 
SDRE, steroid-dependent response element). The Japanese quail ovalbumin promoter 
was isolated by its high degree of homology to the chicken ovalbumin promoter 
(GenBank accession number 100895 M24999, base pairs 431-1332), Some deletions 
20 were noted in the quail sequence, as compared to the chicken sequence. 

Base pairs 4935-4942 are a residue from ligation of restriction enzyme sites 
used in constructing the vector. 

Base pairs 4943 - 5092 are the Japanese quail ovalbumin signal sequence, The 
quail signal sequence was isolated by its high degree of homology to the chicken 
25 signal sequence (GenBank accession number J00S95 M24999, base pairs 2996-3 1 48), 
Some deletions were noted in the quail sequence, as compared to the chicken 
sequence, 

Base pairs 5093-5098 are a residue from ligation of restriction enzyme sites 
used in constructing the vector. 

30 Base pairs 5099 - 5991 are the DsRed reporter coding sequence, including 

poiyA sequence, from pPsRedl I (Chmtech), corresponding to bp 100 - 992 of 
pDsRed LI. 
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Base pairs 59S>2~5997 are a residue from ligation of restriction enzyme sites 
used in constructing the vector. 

Base pairs S998 - 6036 are part of the multiple cloning site of the ZeroBlunt 
Topo cloning vector (Ixivitrogesi), corresponding to base pairs 337-377 of ZeroBtunt. 
5 Base pairs 6037-6042 are a residue from ligation of restriction enzyme sites 

used in constructing the vector. 

Base pairs 6043 - 6475 axe part of the multiple cloning site from pBluescriptH 
sk{~) 3 corresponding to bp 667-235 of pBluescriptH sk(~). 

Base pairs 6476-6480 are a residue from ligation of restriction era&yme sites 
1 0 used in constructing the vector. 

Base pairs 6481 - 6550 are the 70 bp of the right insertion sequence 
recognised by the transposes Tn 10, 

Base pairs 6551 - 6592 are non-coding X DNA that is residual from pNK2859. 

Base pairs 6593 - 7457 are non-coding DNA that is residual from pNK2859. 
15 Base pairs 7458 - 965S are from the pBIuescriptil sk{»} base vector 

(Stratagene, Inc.), corresponding to base pairs 761-2961 of pBIuescriptil sk(~). 

It should be noted that all non-coding DNA sequences described above can be 
replaced with any other non-coding DNA sequence(s). 

20 EXAMPLE 5 

Transfecdon of Stage X Japanese Quail Eggs with pTnMod(Oval/Red) - Quail via 
embryo injection 

Transgenic Japanese quail were produced by transfecting Stage X embryos 
and the heritability of She transgenic delivered by embryo transaction was established. 

25 More specifically, fertile eggs were collected in the morning and placed at 1 5° C until 
enough were collected for isijeciion, but were held no longer than 7 days. Stage X 
embryos (eggs) were assigned to one of two treatment groups Before treatment, each 
egg was incubated on its side at room temperature for about 2 hours to allow die 
embryo to move to "top dead center" (TDC), Each egg was transfected by drilling a I 

30 mm hole (directly above the embryo) through the shell without penetrating the 
underlying shell membrane, A 0,5 ml syringe fitted with a 28 gauge needle was used 
to deliver DNA completed to a transfecting reagent, Le. SUPERFECT®, in a 50 pi 
volume. An adhesive disc was used to seal the hole and provide a label for treatment 
identification. After ail eggs were transfected, they were set in an incubator with the 

35 adhesive disc pointing upward for hatching, 
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Each bird that hatched was bled at one week of age, DNA was extracted from 
blood cells, and PCJR was conducted using 28s primers as a positive control and 
primers specific to DsRed. Any bird that was negative was terminated* while positive 
birds were monitored to determine maintenance of the tansgene. Birds consistently 
5 positive were maintained until sexual maturity and bred. Positive male and female 
birds were mated. The eggs of ma led hens were hatched and the resulting chicks, the 
Gl generation, were evaluated to determine if they were transgenic. All Gls resulting 
from this mating were bled andFCR conducted as described above. 

Egg injection: Two treatment groups and one control group were used for this 

1 0 experiment, Vector pTnMod (Oval/Red) in snpercoiled form (Treatment 1) and in 
linear form (Treatment 2) were used to transfeet 15 eggs per treatment To obtain 
linear DNA for this experiment pTnMod {Oval/Red} was digested with NgoM IV, 
column pari fled, and resuspended in TE buffer. 

Each egg was injected with 0/75 pg of DNA compiexed with SOPERFECTD 

15 in a 1:3 ratio in a total injection volume of 50 |d Hank's Balanced Salt solution 
(HBSS) was used to bring the volume to 50 yd. The DNA Superfect mixture must be 
allowed to incubate (for complex formation) at room temperature for 10 minutes prior 
to injection and must be used within 40 minutes post initial mixing, Eggs were 
incubated as described above after injection. 

20 Results: In the supcreoiled injection group, 2 females and 1 male were 

identified as PGR positive using primers specific to the DsRed coding sequence. 
These birds were mated as described above. Blood was taken from the Gl chicks and 
PGR was conducted- The results showed that the fratisgene was incorporated into the 
gametes of these birds, The G! chicks from these birds were examined on a weekly 

25 basis until it was verified that the gene was not present or enough transgenic Gls were 
obtained to initiate a breeding fiock of fully transgenic birds. Eggs from these Gl 
chicks expressed DsRed protein in the albumin portion of their eggs. 

EXAMPLE 6 

30 Intraiesticuiar Injection of Chickens with pTnModfCMV/Red) (SEQ ID NO:2) 

Immature birds of different ages (4, 6 r 8, 10 5 12, and 14 weeks) were placed 
under anesthesia and injected in the testes with the construct pTtiMod(CMV/Re4). A 
sahne solution containing 1-5 ug of purified DNA vector, mixed with SUPERFECT® 
iransfecting reagent (Qiagen, Valencia, CA) in a 3:6 (wtvol) ratio. The volume of 

35 saline was adjusted so that the total volume injected into each testis was 1 50-200 f.U s 

depending on the age and size of the bird. For the 4- and 6-week-old chickens, I ug 

DNA in 150 \xl was injected in each testis, divided into three doses of 50 $xl each. For 
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the older birds, 200 jxl total volume was injected, containing either 3 pig DNA (for 8~ 
week-old birds) or 5 fig DNA (for older birds) per testis. First, one testis was 
surgically exposed prior to injection, Afler injection, the incision was sutured, and the 
sequence was repeated for the alternate testis. 

From six to nine months po$t~surgery 5 weekly spenn samples were taken from 
each injected bird, as well as from control birds* Each sperm sample was evaluated for 
uptake and expression of the injected gene. Samples were evaluated by PGR on whole 
sperm, within one week after collection. 

Approximately IQQ male white leghorn chickens, in -groups, of 5-26, at ages 4, 
6, 8, 10, 12, and 14 weeks, were used as this is the age range in which it is expected 
that the testes are likely to he most "receptive/ 5 In this age range, the blood/testis 
barrier has not yet formed, and there is a relatively high number of spermatogonia 
relative to the numbers of other ceil types, e,g, ? spermatids, etc. See J. Kumaran et &L, 
1949. Poultry Sci, vol 29, pp. 511-520. See also E, Gakberg, 1956, Am. J. Anatomy, 
vol 99, pp. 507-515; and P. KMn et ah, 1984. Anat EmhtyoL, vol. 169, pp. 73-78. 

The experimental and control males were obtained irom commercial sources 
at one day of age, and maintained in brooders until used. The male birds were housed 
in temperature-controlled spaces in individual standard caging as they approached 
maturity. They were given water and standard commercial feed ad lib. They were 
20 kept initially in a 23:1 hour light/dark cycle, stepped down at approximately weekly 
intervals to a 15:8 hour light/dark cycle, as this regimen has been reported to optimise 
sexual maturity and fertility. 
SyripcaJLiin^^ 

At the appropriate ages, groups of individual males were starved overnight and 
25 then subjected to fxansgene delivery by direct intratesticular injection of DNA by 
experienced animal surgeons. Each male was anesthetized with isoflurane via a 
simplified gas machine. 

Various devices and anesthesia machines have previously been described for 
administering isoflurane (and other gaseous anesthetics) to birds. See Alsage et al., 
30 Poultry Sci., 50:1876-1878 (1971); Greenlees et al., Am. J. Vet. Res., vol. 51, pp. 
757-758 (1990). However, these prior techniques are somewhat cumbersome and 
complex to implement. A novel and much simpler system to administer isoflurane (or 
other gaseous) anesthesia was developed due to the deficiencies in the prior art, a 
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system that we found worked well on all ages of chicks. A standard nose cone was 
placed over the chick's head, similar to the system that has been used for decades to 
administer ether to mice. A plastic tube approximate 3y 3.5 cm in diameter and 12 cm 
long was filled with cotton, into which was poured approximately 2 mh isoflurane 
5 (Abbott Laboratories, Chicago). The chick's head was placed partially into the 
cylinder, and was held in place there intermittently throughout the surgery as required 
to maintain the proper plane of anesthesia ,, without overdosing. 

Bach anesthetized bird was positioned on its side on an animal hoard with 
cords tractiohing the wings and feet to allow access to the testes area. The area was 

10 swabbed with 0.5% ehlorhexidine, and a 2 ern dorsolateral incision was made in the 
skin over the testis (similar to Hie procedure commonly used for caponk&tion), A 
small-animal retractor was used to spread the last two ribs s exposing the testis. Hie 
DNA solution was then mixed with 3U?ERFE€T<B> (Qiagen) according to the 
manufacturer's protocol, approximately a 1:6 wt/voi ratio, to a final concentration of 

15 0,01 - 0-05 jig/pl. This resulted in 1 - 5 total DNA (in a 150-200 jxl volume) being 
injected into each testis, spread over three injection sites: one at each end .of the testis, 
and one in the middle. 

The injection device was a standard 25 gauge, 1/2 inch (L27 cm) hypodermic 
needle, attached to a 50 y 100, or 200 p.1 syringe. Approximately 5 mm of the needle 

20 tip was bent at a 90 degree angle, to facilitate insertion into the testes, Approximately 
50 - 70 fil of the DNA-SUPERFBCT® solution was injected into each of three sites 
per testis* Hie multiple injections were calculated to suffuse the DNA throughout the 
whole testis, the idea being to promote contact between DNA and spermatogonia as 
much as feasible. We estimated that our procedure resulted in the injection of about 

25 100,000 DN A molecules per spermatogonium, The construct used in these tests was 
a highly potent constitutive: modified CMY promoter, operatively iinked to the dsRed 
gene as shown in SBQ ID NO:2. 

Following injection, the incision was closed in two layers with 4-0 absorbable 
suture, and then the contralateral testis was similarly exposed and injected. Following 

30 surgery, each bird was returned to its cage to recover. One hundred thirteen males 
were ultimately used in the experimental regimen to increase the overall likelihood of 
success, along with 4 control birds (16 weeks 20 old) subjected to sham surgery (with 
injections containing only the transection reagent 
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Thus, a total of 1 13 white leghorn chickens were injected with the DNA vector 
in groups of 5-26 at varying ages. Fourteen birds were transformed at 4 weeks, 23 
birds at 6 weeks; 26 birds at 8 weeks; 23 birds at 10 weeks; 5 birds at 12 weeks; and 
5 22 birds at 14 weeks. Sixteen birds died before they could be sampled, so to date, 97 
roosters have been sampled, plus the four controls. Birds were evaluated at 18-24 
weeks of age for (a) potential transformation in the sperm, and (b) successful testis 
transfection. Sperm samples were obtained from each rooster by manual 
manipulation using standard techniques. The sperm were washed, and their DNA was 

10 extracted following the techniques of G. Mann et al., 1993. J. Reprod. Fert, 99:505- 
12. The samples were then frozen until analyzed. Evaluation was conducted by PCR 
analysis to detect DNA integration into the sperm, or into any of the testicular cells. 
Additionally, selected testes were harvested at the end of the sperm sampling period. 

Of 97 birds tested, at least 22 showed probable positive results. Positive 

1 5 results were observed at all transformation ages, except for 4 weeks, which was not 
tested. At least two birds were confirmed positive by PCR of sperm, conducted four 
months after the initial injection. These results were transient in many cases, however 
since it was believed that the DsRed gene product used in these initial proof of 
concept experiments was toxic. Nevertheless, the positive PCR results presumptively 

20 demonstrated that the transgene was incorporated into spermatogonia (before 
puberty), and that it was carried in transgenic sperm. Such sperm could then transmit 
the gene to subsequent generations, resulting in the production of true, germ-line 
transgenic "founder" birds. 

To further confirm that the DNA had been incorporated into the sperm, and 

25 that contaminating vector was not being detected from other sources, it was confirmed 
through PCR on sperm of experimental birds, and on positive and negative controls 
that the sperm of the experimental birds lacked DNA encoding the transposase. The 
design of the preferred transposon-based vector is such that the sequence encoding the 
transposase is contained in the vector, but is not incorporated into the transformed 

30 chromosome. Thus, presence of the exogenous coding sequence, coupled with 
absence of the transposase gene s is strong evidence for incorporation of the exogenous 
coding sequence, or transgene. 
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These results demonstrated proof of concept, as positive PGR results were 
obtained from the sperm of treated birds. Interpretation of these preliminary results 
was made more difficult by the fact that the modified €MV promoter used in the 
experiment was probably too "hot* 5 As tlse DsRed product is not secreted from the 
5 cells, the product built up mtracellukriy to levels that were toxic, frequently killing 
the cells. Even this result of course, means that the transformation was successful, 
Trie transgene could not have killed the cells otherwise., 

In order to resolve to the problem with toxicity of the BsRed gene product, 
experiments were conducted using a different reporter gene operabiy linked to the 
10 ovalbumin promoter, so that the transgene was expressed m the egg white/ These 
experiments are provided in Examples! 2-1 5 below. 



EXAMPLE 7 

Transection of Male White Leghorn Chickens Using the Vector pTnMod(Owd/Red) - 
1 5 Quail (SEQ ID NO: 4) via Testicular Injections 

In further experiments conducted on leghorn chickens, it was demonstrated 

that chickens injected mtratesticiilarly at 8 ? 10, 12, or 14 weeks of age, had, on 

average, approximately 40% positive sperm between 6 and 8 months after injection. 

In other experiments, successful transfeetion was achieved with chickens injected at 
20 13 weeks of age, 

Forty-nine white leghorn roosters approximately 8, 10, 12, or 14 weeks of age 
were obtained and -housed: Birds were identified, wing banded, and assigned to a 
treatment group. If appropriate (based on testes size and vascularization), one testis 
was caponized and the entire BNA injection volume was delivered to the remaining 

25 testis, Thirty-two males received DNA injections of Spg DNA/testis at a 1 :3 ratio of 
DNA to SU PERFECT®. Hie remaining birds were lised as controls. After injection, 
all birds were mated with at least 5 females and observed until sexual maturity and 
egg-laying began. All eggs collected prior to peak egg production (approximately 24 
weeks of age for the hens) were incubated and candled to determine embryo presence, 

30 Any embryos identified were incubated to hatch to extract DNA, PGR was conducted, 
and transgene presence was determined; 

Roosters positive for the pTnMod(Ovai/Red) - Quail construct were kept to 
produce Fl offspring (eggs collected at peak production). Offspring from this hatch 
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were bled, DNA extracted from the blood, and PGR conducted using primers specific 
for the DaRed gene, It was determined that 77% of the offspring were transgenic. 

EXAMPLE 8 

5 Transfection of Mature Male Japanese Quail t4sing the vector pTnMod(Ovat/Red) - 
Quail (SEQ ID NO:4) via Testicular Injections 

Twelve sexually mature males (at approximately 13 weeks of age) underwent 
surgery for testicular injection as described above for chickens. At 21-28 days of age, 
the birds were identified, leg banded, debeaked, and separated based on sex. 
10 Injections comprised 5 fig/testes of the vector in concentrations 1:3 or 1:10 for 
SUPBRFECTd) or a 1:1 ratio with Mirras* The study consisted of 3 treatment groups 
with 5 males in the 1:3 DNA:SUPERFECT® group, 3 males in the 1:10 
BNA:SUPERFECT<S> group, and 4 males in the 1:1 Mkrus group. All surgeries were 
conducted in one day* 

i 5 Any unincorporated DNA was allowed to clear from the testes by holding the 

birds for 19 days before mating with females. At 15 weeks of age, 2 age-matched 
females were housed with each treated mate. The presence of the transfected DNA 
was determined in the fertilized eggs during the second week of egg lay- The 
subsequent eggs collected irorn parents producing positively identified transgenic 

20 eggs were collected and stored until taken to hatch. 

PCR performed on fee sperm of quail injected at three months of age indicated 
successful incorporation of the DsRed transgene into the quail sperm, 

EXAMPLE 9 

25 Transfection of Immature Male Japanese Quail using the vector pTnMod{Ovai/Red) - 
Quail (SEQ ID NQ:4) via Testicular Injections 

Approximately 450 quail eggs were set and hatched. At 21-28 days of age, the 
birds were identified, wingbanded s debeaked, and separated based on sex. At 4 weeks 
of age, 65 male birds underwent surgery and testicular injections as described above. 

30 Injections comprised a control and 2 jug/testes of the vector in varying concentrations 
(0, 1/3, 1/5, and 1/10) of three different transfection reagents: 1) 3UPERFECT®, 2) 
Miras/Panvera and 3) Dosper, The study comprised 13 treatment groups with 5 males 
per group. One transfection reagent was administered per day, 
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At 7 weeks of age, 2 age-matched females were housed with each treated 
male. The presence of the transacted DNA was determined in the fertilized eggs 
during the second week of egg lay. The subsequent eggs collected from parents 
producing positively identified transgenic eggs were collected and stored until taken 
5 to hatch, PGR performed on the sperm of quail injected at four and five weeks of age 
indicated successful incorporation of the DsRed transgene into the quail sperm. 



EXAMPLE 10 

Preparation of 7ranspo$on~8ased Vector pTttMod(£h<af/£NT TAG&146/PA) - 
10 Chicken 

A vector is designed for inserting a p\46 gene under Hie control of a chicken 
ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, 
into the genome of a bird given below as SEQ ID NO:29. 

Base pairs ! - 130 are a remainder of FI(-) ori of pBiuescnptU sk(«) 
1 5 (Siratagene) corresponding to base pairs I -130 of pBhaescripill sk(-). 

Base pairs 133 - 1777 are a CMV promoter/enhancer taken from vector 
pGWiz (Gene Therapy Systems) corresponding to base pairs 229- i 873 of pGWiz. 

Base pairs 1780 - 29S7 are a transposase, modified from TnlO (GerrBank 
accession number JO 1 829}. 
20 Base pairs 2988-2993 are an engineered stop codon. 

Base pairs 2995 - 3410 are a synthetic polyA from pCSWiz (Gene Therapy 
Systems) corresponding to base pairs 1922- 2337 of pGWiz. 

Base pairs 3415 . — 3718 are non coding DNA that is residual front vector 
pNK2S59. 

25 Base pairs 3719 - 376 1 are X DNA that is residua! from pNK2859. 

Base pairs 3762 - 3S3I are the 70 base pairs of the left insertion sequence 
(IS 10) recognized by (he rransposon TnlO, 

Base pairs 3S38 - 4044 are a multiple cloning site from pBlueScriptn sk(~> 
corresponding to base pairs 924-718 of pBluescriptll sk(~), 
30 Base pairs 4050 - 4951 are a chicken ovalbumin promoter (including SDR.E) 

that corresponds to base pairs 431-1332 of the chicken ovalbumin promoter in 
GenBank Accession Number 100895 M24999, 
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Base pairs 4958 - 6115 ate a chicken ovalbumin signal sequence and 
Ovalbumin gene that correspond to base pairs 66-1223 of GenBank Accession 
Number V00383.1 (The STOP codon being omitted). 

Base pairs 6122 - 6271 are a TAG sequence containing a gp41 hairpin loop 
5 from HIV I, an enterokinase cleavage site and a spacer (synthetic). 

Base pairs 6272 - 6316 are a pi46 sequence (synthetic) with 2 added stop 
eodons. 

Base pairs 6324 - 6676 are a synthetic polyadenylation sequence from pGWiz 
(Gene Therapy Systems) corresponding to base pairs 1920 - 2272of pGWiz. 
10 Base pairs 6682 - 7114 are a multiple cloning site from pBlueScriptil sk(-) 

corresponding to base pairs 667-235 of pBluescriptll sk{-)„ 

Base pairs 7120- 7189 are the 70 base pairs of the right insertion sequence 
(IS 10) recognized by the transposes TnlG, 

Base pairs 7190 ~ 7231 are X DNA that is residua! from pNX2859. 
1 5 Base pairs 7232 - 8096 are non coding DNA that is residual from pNK2859. 

Base pairs 8097 - 10297 are pBkaeScript sk(«) base vector (Stratagene, Inc.) 
corresponding to base pairs 761-296Iof pBluescriptll sk(-)« 



It should he noted that all non-coding DNA sequences described above can be 
20 replaced with any other non-coding DNA sequence(s). Missing nucleotide sequences 
in the above construct represent restriction site remnants. 



EXAMPLE 1 1 

Preparation of Transposan-Based Vector pTnMod(Oval/ENT TAG/pl46/PA.) — Quail 
25 A vector is designed for inserting a pi46 gene under the control of a quail 

ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, 

into the genome of a bird given below as SBQ ID NO:30, 

Base pairs 1 * 130 are a remainder of Fl(-) ori of pBluescriptll sk(») 

(Stratagene) corresponding to base pairs 1-130 of pBluescriptll sk(~). 
30 Base pairs 133 - 1777 are a CMV promoter/enhancer taken from vector 

pGWiz (Gene Therapy Systems) corresponding to base pairs 229-1873 of pGWiz. 

Base pairs 1780 - 2987 are a transposase, modified from TnlO (GenBank 

accession number JO 1 829). 

Base pairs 2988-2993 are an engineered stop codon, 
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Base pairs 2995 - 3410 are a synthetic poIyA from pGWiz (Gene Therapy 
Systems) corresponding to base pairs 1922-2337 of pGWiz. 

Base pairs 3415 - 371S are non coding DNA that is residisal from vector 
pNK2859. 

5 Base pairs 3719 - 3761 are X DNA that is residual from pNK2S59. 

Base pairs 3762 - 3831 are the 70 base pairs of the left insertion sequence 
(IS 10) recognized by the transposon TnlO\ 

Base pairs 3838 - 4044 are a multiple cloning site from pBiueScriptll sk(-) 
corresponding to base pairs 924-718 of pBluescriptll sk(~), 
10 Base pairs 4050 - 4938 are the Japanese quail ovalbumin promoter (including 

SDRE, steroid-dependent response element). The Japanese quasi ovalbumin promoter 
was isolated by its high degree of homology to the chicken ovalbumin promoter 
(GenBank accession number J0Q895 M24999, base pairs 43 1 - 1332). 

Bp 4945 - 6092 are a quail ovalbumin signal sequence and ovalbumin gene 
15 that corresponds to base pairs 54 - 1201 of GenBank accession number X53964.L 
(The STOP codon being omitted). 

Base pairs 6097 - 6246 are a TAG sequence containing a gp41 hairpin loop 
from HIV I, an enterokinase cleavage site and a spacer (synthetic). 

Base pairs 6247 - 6291 are a p!46 sequence (synthetic) with 2 added stop 
20 codons. 

Base pairs 6299 - 6651 are a synthetic polyadenylatiom sequence from pGWiz 
(Gene Therapy Systems) corresponding to base pairs 1920 - 2272of pGWiz. 

Base pairs 6657 - 7089 are a multiple cloning site from pBlueScriptll sk(~) 
corresponding to base pairs 667-235 of pBhiescriptll sk(~). 
25 Base pairs 7095- 7164 are the 70 base pairs of the right insertion sequence 

(IS 10) recognised by the transposon Tnl 0. 

Base pairs 7165 ~ 7206 are X DNA that is residual from pNK2859. 
Base pairs 7207 - 8071 are non coding DNA that is residual irons pNK2859, 
Base pairs 8072 - 10272 are pBlueScript sk(-) base vector (Stratagene, Inc.) 
30 corresponding to base pairs 76 1-2961 of pBluescriptlt sk(-). 
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It should be noted that all non-coding DNA sequences described above can be 
replaced with any other non-coding DNA sequence(s). Missing nucleotide sequences 
in the above construct represent restriction site remnants. 



5 EXAMPLE 12 

Preparation of Transposon-Based Vector pTnMod(Oval/ENT TAG/Prolns/PA) ~ 
Chicken 

A vector is designed for inserting a proinsnlin gene under the control of a 
chicken ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal 
10 sequence, into the genome of a bird given below as SEQ ID NO:3 1 , 

Base pairs I -130 are a remainder of FS(~) on of pBluescriptll sk(-) 
{Stratage&e} corresponding to base pairs 1-130 of pBluescriptll sk{~). 

Base pairs 133 - 1777 are a CMY promoto/enhancer taken from vector 
15 pGWiz (Gene Therapy Systems) corresponding to base pairs 229-1S73 of pGWiz. 

Base pairs 1780 - 2987 are a transposase, modified from TnlG (GenBank 
accession number JO 1829). 

Base pairs 2988-2993 are an engineered stop eodon* 

Base pairs 2995 - 3410 are a synthetic polyA from pGWiz (Gene Therapy 
20 Systems) corresponding to base pairs 1922- 2337 of pGWis. 

Base pairs 3415 - 3718 are non coding DNA that is residual from vector 
pNK2859. 

Base pail's 3719 - 3761 are X DNA that is residual from pNK2859. 

Base pairs 3762 ~ 3831 are the 70 base pairs of the left insertion sequence 
25 (IS 1 0) recognized by the transposon TnlG, 

Base pairs 3 S3 8 - 4044 are a multiple cloning site from pBlueScriptll sk{~) 
corresponding to base pairs 924-7 1 8 of pBluescriptll $k{~). 

Base pairs 4050 - 4951 are a chicken ovalbumin promoter (including SDRE) 
that corresponds to base pairs 431-1332 of the chicken ovalbumin promoter in 
30 GenBank Accession Number J00S95 M24999, 

Base pairs 4958 - 6115 are a chicken ovalbumin signal sequence and 
ovalbumin gene that correspond to base pairs 66-1223 of GenBank Accession 
Number V00383.1 . (The STOP codon being omitted). 
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Base pairs 6122 - 6271 are a TAG sequence containing a gp4i hairpin loop 
from HIV I, an erilerokiinase cleavage site and a spacer (synthetic). 
Base pairs 6272 ™ 653 1 are a proinsnlin gene. 

Base pairs 6539 - 6891 are a synthetic poSyadenylation sequence from pGWiz 
5 (Gene Therapy Systems) corresponding to base pairs 1 920 - 2272of pGWiz, 

Base pairs 6897 - 7329 are a multiple cloning site from pBlueSeriptll sk(-) 
corresponding to base pairs 667-235 of pEluescriptli sk(~). 

Base pairs 7335- 7404 are the 70 base pairs of the right insertion sequence 
(IS 10) recognized by the transposora TnlO. 
1 0 Base pairs 7405 - 7446 are X DNA that is residuaJ from pNK2859, 

Base pairs 7447 - 83 11 are non coding DNA that is residual from pNK2859. 

Base pairs 8312 - 10512 are pBlueSeript sk{~) base vector (Stratagene, Inc.) 
corresponding Co base pairs 761-2961 of pBluescripfll sk(-). 

1 S It should be noted that all non-coding DNA sequences described above can be 

replaced with any other non-coding DNA sequence(s). Missing nucleotide sequences 
in the above construct represent restriction site rerxmants 

EXAMPLE 13 

20 Preparation of Transpos on-Based Vector pTnMpd(OmVENT TAG/Prolns/PA) - 
Quail 

A vector is designed tor inserting a proixisulin gene under the control of a 
chicken ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal 
sequence, into the genome of a bird given below as SEQ ID NO:32, 

25 

Base pairs 1 -130 are a remainder of FI{~) ori of pBluescriptlX sk(-) 
(Stratagene) corresponding to base pairs 1-130 of pBluescriptll sk{~). 

Base pairs 133 - 1777 are a CMV promoter/enhancer taken from vector 
pGWfe (Gene Therapy Systems) corresponding to base pairs 229-1873 of pGWiz, 
30 Base pairs 1780 - 2987 are a iransposase, modified from TnlO (GenBank 

accession number J01 829). 

Base pairs 2988-2993 are an engineered stop codon* 
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Base pairs 2995 - 3410 are a synthetic polyA from pGWiz (Gene Therapy 
Systems) corresponding to base pairs 1922- 2337 of pGWiz. 

Base pairs 3415 - 3718 are non coding DNA that is residual from vector 
pNK2859. 

5 Base pairs 3719-3761 are X DNA that is residual from pNK2859. 

Base pairs 3762 - 3831 are the 70 base pairs of the left insertion sequence 
(IS10) recognized by the transposes! TnlO, 

Base pairs 3838 - 4044 are a multiple cloning site from pBlueScriptXT slc(-) 
corresponding to base pairs 924-7 IE of pBIuescriptll sk(-). 
10 Base pairs 4050 - 4938 are the Japanese quail ovalbumin promoter (including 

SDRE, steroid-dependent response element). The Japanese quail ovalbumin promoter 
was isolated by its high degree of homology to the chicken ovalbumin promoter 
(GenBank accession number J00895 M24999, base pairs 431-1332). Some deletions 
were noted in the quail sequence, as compared to the chicken sequence. 
15 Base pairs 4945 - 6092 are a quail ovalbumin signal sequence and ovalbumin 

gene that corresponds to base pairs 54 - 1201 of GenBank accession number 
X53964.1. (The STOP codon being omitted). 

Base pairs 6093 - 6246 are a TAG sequence containing a gp41 hairpin loop 
from HTV I an enterokinase cleavage site and a spacer (synthetic). 
20 Base pairs 6247 - 6507 are a proinsulin gene. 

Base pairs 6514 - 6866 are a synthetic polyadenylation sequence from pGWiz 
(Gene Therapy Systems) corresponding to base pairs 1920 - 2272of pGWiz. 

Base pairs 6867 - 7303 are a multiple cloning site from pBlueScriptH sk(-) 
corresponding to base pairs 667-235 of pBIuescriptll sk(-). 
25 Base pairs 7304- 7379 are the 70 base pairs of the right insertion sequence 

(IS 10) recognized by the transposon TnlO. 

Base pairs 7380 - 742 1 are X DNA that is residual from pNK2859. 
Base pairs 7422 - 8286 are non coding DNA that is residual from pNK2859. 
Base pairs 8287 - 10487 are pBlueScript sk(-) base vector (Stratagene, Inc.) 
30 corresponding to base pairs 76 1 -296 1 of pBIuescriptll sk(-). 
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It should be noted that all non-coding DNA sequences described above can be 
replaced with any other 8Km-coding DNA sequence(s). Missing nucleotide sequences 
in the above construct represent restriction site remnants. 

5 EXAMPLE 14 

Transfection of Immature Leghorn Roosters using a Transpson-based Vector 

containing a Prainsuiin Gene via Test icular Injections 

Vectors containing the elements Ova* promoter/Ova! gene/GF41 Emerokinase 

TAG/'Promsulixi/Foly A (SEQ ID NO:31) and €MV promoter/Oval gene/G?4 1 
10 Enterokiiiase TAG/PrommliwToly A {SEQ ID NO:42) were each injected mto the 

testes of 1 1 week old white leghorn roosters. These birds were held under normal 

conditions until sexual maturity was reached. 

At the time of sexual maturity, each bird was handled imd manipulated to 

obtain sperm, Sperm samples were collected in Hank's Buffered Salt Solution 
15 (HBSS) and stored at either -20* C or 4° C Mis! needed DNA was extracted from 

sperm using a MoBio Ultra Clean DNA Btoodspih Kit (MoBio laboratories, Solana 

Beach C A), Fifty microliters of sperm was used m the DNA extraction protocol and 

the purified genomic DNA eMted in 100 p\ of water- In each PGR reaction, 

approximately 0.5 ~ 0.75 fig of genomic DNA was used with primers anchored in the 
20 eniag-l (5") and the synthetic poIyA-2 which amplify a 685 bp fragment, Five of 

nine birds gave positive reactions for the presence of the appropriate vector construct. 

These birds were then mated with normal females. 

Birds that did not yield positive results with PGR on the sperm were 

sacrificed, their testes removed, and DNA extracted using an approximately 25 mg 
25 piece of tissue in a Qiagen DNEasy Tissue Kit; puritied DNA was eluted m 200 f*l 

water and PGR conducted as described above, Two of these birds gave a very strong, 

positive PCR reaction, 

EXAMPLE 1 5 

30 Tmnsfeetioft of Japanese Quail using a Transposon-based Vector containing a 
Proinsuiin Gene via Oviduct Injections 

Two experiments were ' conducted in Japanese quail using tr&nspson-based 
vectors containing either Oval promoter/Oval gene/GP41 Enterokinase 
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TAG/Promsulin/Foly A (SEQ ID NO:3I) or CMV promoter/Oval gene/GP41 
Bnierokinase TAG/Proinsutin/Poly A (SEQ ID NO:42). 

In the first experiment, the Ova! promoter/Oval gene/GP41 Bnteiokinase 
TAG/Proinsulin/Poly A containing construct was injected into the oviduct of sexually 
5 mature quail; three hens received 5 p,g at a 1:3 Superfect ratio and three received 10 
p.g at a 1:3 Superfect ratio. As of the writing of the present application, at least one 
bird that received 10 $ig of DNA was producing human prosnsuiin in egg white (other 
birds remain to be tested). This experiment indicates that 1) the DNA has been stable 
for at least 3 months; 2) protein levels are comparable to those observed with a 

10 constitutive promoter such as the CMV promoter; and 3) sexually mature birds can be 
injected and results obtained without the need for cell culture. 

In the second experiment, the transposon-based vector containing CMV 
promoter/Oval gene/GF41 Enterokinase TAG/Proinsulin/Poly A was injected into the 
oviduct of sexually immature Japanese quail A total of 9 birds were injected. Of the 

15 E survivors, 3 produced human proinsulin m the white of their eggs for over 6 weeks. 
An ELISA assay described in detail below was developed to detect GF41 in the fusion 
peptide (Oval gene/GF41 Bnterokinase TAG/Proinsulin) since the GP4I peptide 
sequence is unique and not found as part of normal egg white protein. In all ELISA 
assays, the same birds produced positive results and all controls worked as expected, 

20 ELISA Procedure: Individual egg white samples were diluted in sodium 

carbonate buffer, pH 9.6, and added to individual wells of 96 well microtiter ELISA 
plates at a total volume of OA ml. These plates were then allowed to coat overnight at 
4°C Prior to ELISA development, the plates were allowed warm to room 
temperature. Upon decanting the coating solutions and blotting away any excess, 

25 non-specific binding of antibodies was blocked by adding a solution of phosphate 
buffered sabne (PBS} ? 1% (w/v) BSA ? and 0,05% (v/v) Tween 20 and allowing it to 
incubate with shaking for a minimum of 45 minutes. This blocking solution was 
subsequently decanted and replaced with a solution of She primary antibody (Goat 
Anti«GF41 TAG) diluted in fresh PBS/BSA/Tween 20. After a two hour period of 

30 incubation with the primary antibody, each plate was washed with a solution of FOS 
and 0.05% Tween 20 in an automated plate washer to remove unbound antibody. 
Next, the secondary antibody, Rabbit anti-Goat Alkaline Phosphatase-conjugated, was 
diluted in PBS/BSA/Tween 20 and allowed to incubate 1 hour. The plates were then 
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subjected to a second wash with PBS /Tween 20, Antigen was detected using a 
solution of /?~Nitephenyi Phosphate in Dicthanolamme Substrate Buffer for Alkaline 
Phosphatase and measuring the afosorbance at 30 minutes and 1 hour. 

5 EXAMPLE 16 

Optimization oflntra-oviduci and Infra-ovarian Arterial Injections 

Overall trans fection rates of oviduct cells in a flock of chicken or quail hens 
are enhanced by synchronizing the development of the oviduct and ovary within the 
flock. When the development of the oviducts and ovaries are uniform across a group 
10 of hens and when the stage of oviduct and ovarian development can be determined or 
predicted, timing of injections is optimised to transfect the greatest number of cells. 
Accordingly, oviduct development is s>7ichronizsd as described below to ensure that a 
large and uniform proportion of oviduct secretory cells are traosfected with the gene 
of interest. 

15 Hens are treated with estradiol to stimulate oviduct maturation as described in 

Oka and Schimke (T. Oka and RT Sehimke, J. Cell Biol.* 41 $ 816 (1969)), Palm iter, 
Christensen and Schimke (J BioL Cliem, 24 5(4); 83 3 -845, 1970). Specifically, 
repeated daily injections of 1 rng estradiol bensoate are performed sometime before 
the onset of sexual maturation, a period ranging from 1-14 weeks of age. After a 

20 stimulation period sufficient to maximize development of the oviduct, hormone 
treatment is withdrawn thereby causing regression in oviduct secretory cell size but 
not cell number. At an optimum time after hormone withdrawal, the oviducts of 
treated hens are injected with the transposon-hased vector. Hens are subjected to 
additional estrogen stimulation after an optimized time during which the transposon- 

25 based vector is taker* up into oviduct secretory cells. Re~stimu!ation by estrogen 
activates the transposon mechanism of the transposon-based vector, causing the 
integration of the gene of interest into the host genome. Estrogen stimulation is then 
withdrawn and hens continue normal sexual development. If a developmental 
regulated promoter such as the ovalbumin promoter is used, expression of the 

30 transposon-based vector initiates in the oviduct at the time of sexual maturation, 
Intra-ovaxian artery injection during this window allows for high and uniform 
transfeetion efficiencies of ovarian follicles to produce germ-line transfections and 
possibly oviduct expression. 
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Other means are also used to synchronize the development, ox recession, 
the oviduct and ovary to allow high and uniform transfeetion efficiencies. Alterations 
of lighting and/or feed regimens, for example, cause hens to 'molf during which time 
the oviduct and ovary regress. Molting is used to synchronize herns for transfection, 
and may be used in conjunction with other hormonal methods to control regression 
and/or development of the oviduct and ovary. 



EXAMPLE 17 

Isolation of Human Pro insulin Using Anti-TAG Column Ckromotogmpky 

10 A BiTrap NHS-actBV&ted 1 ml column (Amersham) was charged with a 30 

amino acid peptide that contained the gp-4I epitope containing gp~41*s native 
disulfide bond that stabilizes the formation of the gp~4I hairpin loop. The 30 amino 
acid gp4X peptide is provided as SEQ ID NO; 23. Approximately 10 mg of the 
peptide was dissolved in coupling buffer (0.2 M NaHCOS, 0.5 M NaCI s pH 83 and 

15 the iigand was circulated on the column for 2 hours at room temperature at 0.5 
mL/mixmte. Excess active groups were then deactivated using 6 column volumes of 
0,5 M ethanolamme, 0,5 M NaCl, pH 8.3 and the column was washed alternately with 
6 column volumes of acetate buffer (0,1 M acetate, 0,5 M NaCfi, pH 4,0) and 
ethanolamine (above). The column was neutralized using 1 X FBS. The column was 

20 then washed with buffers to he used in affinity purification: 75 wM Tris, pH 8.0 and 
elution buffer, 100 mM glycine=HCl 4 0*5 M NaCl, pH 2,7. Finally, the column was 
equilibrated in 75 mM Tris buffer, pH 8.0, 

Antibodies to gp-4! were raised in goats by inoculation with the gp~4i peptide 
described above. More specifically, goats were inoculated, given a booster injection 

25 of the gp-41 peptide and then bled. Serum was harvested by centrifhgation. 
Approximately 30 ml of goat serum was filtered to 0,45 uM and passed over a TAG 
column at a rate of 0,5 mLAnin, The column was washed with 75 mM Tris, pH 8,0 
until absorbanee at 2 SO nm reached a baseline. Three column volumes (3 ml) of 
elation buffer (100 mM glycine, 0.5 M NaCl* pH 2.7) was applied, followed by 75 

30 mM Tris buffer, pH 8.0 > all at a rate of 0.5 mL/min. Que milliliter fractions were 
collected. Fractions were collected into 200 uL 1 M Tris^ pH 9.0 to neutralize acidic 
factions as rapidly as possible, A large peak eluted from the column, coincident with 
the application the elution buffer, Fractions were pooled, Analysis by 3DS-PAGE 
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showed a high molecular weight species that separated into two fragments under 
reducing condition, in keeping with the heavy and light chain structure of IgG. 

Pooled antibody fractions were used to charge two 1 mh HiTrap NHS- 
activated columns, attached in series. Coupling was earned out in the same manner as 
5 that used for charging the TAG column, 
feolati^^ 

Egg white from quail and chickexss treated by intra-oviduct injection of the 
CMV^ovalbuniin-TAG-'proinsuiiu construct were pooled. Viscosity was lowered by 
subjecting the allantoid fluid to successively finer pore sizes using negative pressure 

10 filtration, finishing with a 0.22 uM pore sixe. Throngh the process, egg white was 
dihitQd approximately 1:16, The clarified sample was loaded on the Anti-TAG 
column and eiuted in the same manner as described fox the purification of the anti- 
TAG antibodies. A peak of absorbance at 280 nm, coincident with the application of 
the elution buffer, indicated that protein had been specifically eiuted from the Ami- 

1 5 TAG coluxnxi. Fractions containing the eiuted peak were pooled for analysis. 

The pooled fractions from the Anti-TAG affinity column were characterized 
by SDS-PAGE and western blot analysis, SDS-F AGE of the pooled fractions revealed 
a 60 kBal molecular weight hand not present in control egg white fluid, consistent 
with the predicted molecular weight of the transgenic protein. Although some 

20 contaminating hands were observed, the 60 kBal species was greatly enriched 
compared to the other proteins. An aliquot of the pooled fractions was cleaved 
overnight at room temperature with the protease, enterokinase, SDS-PAGE analysis 
of the cleavage product, revealed a band not present in the uncut material that co- 
migrated with a commercial human ptomsuhn positive control. Western blot analysis 

25 showed specific binding to the 60 kDa! species under non-reducmg condition (which 
preserve the hairpin epitope of gp*41 by retaining the disaltlde bond). Western 
analysis of the low molecular weight species that appeared upon cleavage with an 
and -human pro insulin antibody, conclusively identified the cleaved fragment as 
human proinsnlin. 

30 
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EXAMPLE 1 8 

Construction of a Tramposan-hased Transgene for the Expression of a Monoclonal 
Antibody 

Production of a monoclonal antibody using transposon-based transgenic 
5 methodology is accomplished in a variety of ways, 

1) two vectors are constructed: one that encodes the light chain and a second vector 
that encodes the heavy chain of the monoclonal antibody. These vectors are then 
incorporated into the genome of the target animal by at least one of two methods: a) 
direct transfecrfon of a single animal with both vectors (simultaneously or as separate 

1 0 events); or, b) a male and a female of the specks carry in their gennline one of the 
vectors and then they are mated to produce progeny that inherit a copy of each. 

2) the light and heavy chains are included on a single DMA construct, either separated 
by insulators and expression is governed by the same (or different) promoters, or by 
using a single promoter governing expression of both transgenes with the inclusion of 

1 5 elements that permit separate transcription of both transgenes 3 such as an internal 
ribosame entry site. 

The following example describes the production of a transposon-based DNA 
construct that contains both the coding region for a monoclonal light chain and a 
heavy chain on a single construct, Beginning with the vector pTnMod, the coding 

20 sequences for the heavy and light chains are added, each preceded by an appropriate 
promoter and signal sequence. Using methods known to one skilled in the art, 
approximately 1 Kb of the proximal elements of the ovalbumin promoter are linked to 
the signal sequence of ovalbumin or some other protein secreted from the target 
tissue. Two copies of the promoter and signal sequence are added to the multiple 

25 cloning site of pTnMod, leaving space and key restriction sites between them to allow 
the subsequent addition of the coding sequences of the lig^i and heavy chains of the 
monoclonal antibody. Methods known to one skilled in the art allow the coding 
sequences of the light and heavy chains to be inserted in-frame tor appropriate 
expression. For example, the coding sequence of light and heavy chains of a murine 

30 monoclonal antibody that show specificity for human seminoprotein have recently 
been disclosed (GenBank Accession numbers A Yl 29006 and AY 129304 ibr the light 
and heavy chains, respectively). The light chain cDNA sequence is provided in SEQ 
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ID NO:34 ? whereas the cDNA of the heavy chain is reported as provided in SEQ ID 
NO: 35. 

Thus one skilled in the art can produce both the heavy and light chains of a 
monoclonal antibody in a single cell within a target tissue and species. If the modified 
5 cell contained normal posttranslationa! modification capabilities, the two chains 
would form their native configuration and disulfide attachments and be substrates for 
glycosylation. Upon secretion, then, the monoclonal antibody is accumulated, for 
example, in the egg white of a chicken egg. if the transgenes are expressed in the 
magnum of the oviduct, 
10 It should also he noted that, although this example details production of a full- 

length murine monoclonal antibody, the method is quite capable of producing hybrid 
antibodies (e.g* a combination of human and murine sequences; "humanized* 
monoclonal antibodies), as well as useful antibody fragments, known to one skilled in 
the art, such as Fab, Fc # F(ab) and JFV fragments. This method can be used: .to produce 
15 molecules containing the specific areas thought to be the antigen recognition 
sequences of antibodies (complementarity determining regions), linked, modified or 
incorporated into other proteins as desired, 

EXAMPLE 19 

Treatment of rats with a transpason-hased vector for tissue-specific insulin gene 
incorporation 

Rats are made diabetic by administering the drug streptozotocin (Zanosar, 
Upjohn, Kalamazoo, Mi) at approximately 200 rag/kg. Hie rats are bred and 
maintained according to standard procedures, A transposon-based vector containing a 
proinsulin gene, an appropriate carrier, and, optionally, a transfeetion agent, are 
injected into rats' singhepatic {if using G6P) artery with the purpose of stable 
transformation. Incorporation of the insulin gene into the rat genome and levels of 
insulin expression are ascertained by a variety of methods known in the art. Blood 
and tissue samples from live or sacrificed animals are tested. A combination of PCXL 
Southern and Northern blots, in-situ hybridisation and related nucleic acid analysis 
methods are used to determine incorporation of the vector- derived proinsaiin DNA 
and levels of transcription of the corresponding mRNA in various organs and tissues 
of the rats. A combination of 3DS-PAGE gels, Western Blot analysis, 
radioimmunoassay, and ELISA and other methods known to one of ordinary skill in 
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the art are used to determine the presence of insulin and the amount produced, 
Additional tr&nsfeetions of the vector are used to increase protein expression if the 
initial amounts of the expressed insulin are not satisfactory, or if the level of 
expression tapers off, The physiological condition of the rats Is closely examined 
5 postinfection to register positive or aay negative effects of the gene therapy. 
Animals are examined over extended periods of time post-tansfection in order to 
monitor the stability of gene incorporation and protein expression. 



EXAMPLE 20 

10 

Exemplary Transp&son-Based Vectors 

Hie following example provides a description of various transpo son-based 
vectors of the present invention and several constructs for insertion into the 
transposes- -based vectors of the present invention. These examples are not meant to 
15 be limiting in any way. The constructs tor insertion ixjto a transposon-based vector 
are provided in a cloning vector labeled pTnMCS, 



pTnMCS , (ba se yec fo rj 

Bp 1 - 130 Remainder of Fi {-) ori of pBluescriptil sk(-) (Strata gene) bpI-130 
20 Bp 133 — 1777 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy 
Systems) bp2 29-1873 

Bp 1783-2991 Transposasej from TnlO (GeraBank accession #J01 829) bp 108-1316 
Bp 2992 - 3344 Non coding DNA from vector pMK2859 
Bp 3345 ~ 3387 l ambda DNA from pNK.2859 
25 Bp 3383 - 3457 70 bp of ISI0 left from TnlO 

Bp 3464 - 3670 Multiple cloning site from pBluescriptll sk{-) ? thru the Xmal site 
hp924-7I8 

Bp 3671 - 3715 Multiple cloning site from pBkje&criplil sk(-) 5 from the Xrnal site 
thru the Xbol site. These base pairs are usually lost when cloning into pTnMCS.bp 
30 717-673 

Bp 3716 - 4153 Multiple cloning site from pBluesoriptil sk{-), from the Xhol site 
bp672-23S 

Bp 4 1 59 ~ 4228 70 bp of IS 10 left from TnlO 

Bp 4229 - 4270 Lambda DNA from pNK2S59 
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Bp 427 1 - 5 1 1 4 Non-coding DNA from pN&2859 

Bp 51 1 5 - 7315 pBluescript sk (-) base vector (Siratagene, Inc.) bp 761-2961 

pTrtMCS (CMV-prepro^nt-hGH<:PAT 
5 Bp 1 - 3670 torn vector PTuMCS, bp 1 ~ 3670 

Bp 3676 - 5320 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy 
Systems), bp 230-1864 

Bp 5326 - 5496 Capsile/Prepto taken from GsnB&nk accession # XG7404, bp 563 - 
733 

10 Bp 5504 - 5652 Synthetic spacer sequence and hairpin loop of HIV gp41 with an 
added enterokinase cleavage site 

Bp 5653 - 6306 Human growth hormone taken from GenBank accession # V005I9, 
bp 1-654 

Bp 63 1 3 - 6720 Conalbumin polyA taken from GeraBank accession # Y00407, bp 
15 10651-11058 

Bp 6722 -1 032 1 from cloning vector pTnMCS, bp 3 7 1 6-73 1 5 

Bp 1 - 3670 from vector PTnMCS, 'bp I - 3670 
20 Bp 3676 - 5320 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy 
Systems), hp 230-1 864 

Bp 5327 -6480 Chicken ovalbumin gene taken from GenBank accession # VGQ383, 
bp 66-1219 

Bp 6487 - 6636 Synthetic spacer sequence and hairpin loop of HIV gp41 with an 
25 added enterokinase cleavage site 

Bp 6637 - 6897 Human Froinsulin taken from GeaBank accession # NM000207 9 bp 
117-377 

Bp 6898 - 6942 Spacer DNA, derived as an artifact from the cloning vectors pTOFO 
Blunt II (Invitrogen) and pGWIZ (Gene Therapy Systems) 
30 Bp 6943 - 7295 Synthetic poly A from the cloning vector pGWIZ (Gene Therapy 
Systems), bp 1920-2271 

Bp 7296 - 10895 from ekmhig vector pThMCS, bp 3716-7315 

86 



WO 2004/003157 



PCT/US2003/020389 



Bp 1 ~ 3670 from vector FTnMCS, bp 1 - 3670 

Bp 3676 - 5320 CMV promoter/enhancer taken from vector pGMZ (Gene Therapy 
Systems),, bp 230*1864 
5 Bp 5326 - 5496 Capsite/Prepro taken fron GenBank accession # XO7404, bp 563 - 
733 

Bp 5504 - 5652 Synthetic spacer sequence and hairpin loop of HIV gp41 with an 
added enterokinase cleavage site 

Bp 5653 - 5913 Human Froinsulin taken from GenBank accession # NM000207, bp 
10 117-377 

Bp 5914 — 5958 Spacer DNA S derived as an artifact from the cloning vectors pTOFO 
Blunt II (Inviirogen) and pGWIZ (Gene Therapy Systems) 

Bp 5959 « 6310 Synthetic poly A from the cloning vector pGWIZ {Gene Therapy 
Systems), bp 1920-2271 
1 5 Bp 6313 -9912 from cloning vector pTnMCS s bp 3 7 1 6-73 1 5 




Bp 1 - 3670 from vector pTnMCS, bp 1 - 3670 

Bp 3676 - 4350 Chicken Ovalbumin enhancer taken from GenBank accession 
20 #882527 J bp 1-675 

Bp 4357 - 5692 Chicken Ovalbumin promoter takes* from GenBank accession # 
J00S95M24999 bp 1 - 1 336 

Bp 5699 - 6917 Chicken Ovalbumin gene from GenBank Accession # V003B3.I bp 
2-1220, (This seqisence includes the S'XJFR, containing putative cap site, bp 5699- 
25 5762,} 

Bp 6924 - 7073 Synthetic spacer sequence and hairpin loop of HTY gp41 with an 
added enferokinase cleavage site 

Bp 7074 - 7334 Human proinsulin GenBank Accession # :NMO0G2O7 bp 1 17-377 
Bp 7335 - 7379 Spacer DNA, derived as an artifact from the cloning vectors pTOPO 
30 Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems) 

Bp 7380 - 7731 Synthetic poiyA from the cloning vector gWXZ (Gene Therapy 
Systems) bp 1920 - 2271 

Bp 7733 - 11332 from vector pTnMCS, bp 3716 - 73 15 
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Bp I - 3670 from cloni ng vector pTnMCS, bp 1 - 3670 

Bp 3676 - 4350 Chicken Ovalbumin enhancer iaken from GenBank accession U 
S82527.1 bp 1-675 

Bp 4357 - 5692 Chicken Ovalbumin promoter taken from GenBank accession # 
J00895-M24999 bp 1-1 336 

Bp 5699 - 5 869 Cecropin cap site and Frepro, Genbank accession # X07404 bp 563- 



Bp S876 - 6025 Synthetic spacer sequence and hairpin loop of HIV gp4! with an 
added enterokinase cleavage site 

Bp 6026 - 6286 Human proinsuHn GenBank Accession # NM000207 bp 1 17-377 
Bp 6287 - 6331 Spacer DNA 5 derived as m artifact from the cloning vectors pTOPO 
Blunt II (Invitrogen) and gWlZ (Gene Therapy Systems) 

Bp 6332 - 6683 Synthetic poIyA from the cloning vector gWIZ (Gene Therapy 
Systems) bp 1920-2271 

Bp 6685 - 10284 from cloning vector pTnMCS, bp 3716 - 7315 

pTnMCSCQu a II OVe^H<)Vg^ENT4pr(3la^svn poivAl 
Bp 1 ~ 3670 from clotnng vector pTnMCS, bp 1 - 3670 

Bp 3676 - 4333 Quail Ovalbumin enhancer: 658 bp sequence, amplified in-house 
from quail genomic DNA S roughly equivalent to the far-upstream chicken ovalbumin 
enhancer. GenBank accession # S82527J, bp 1-675. (There are multiple base pair 
substitutions and deletions in the quail sequence, relative tochieken, so the number of 
bases does not correspond exactly,) 

Bp 4340 - 5705 Quail Ovalbumin promoter: 1366 hp sequence, amplified in-house 
from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, 
GenBank accession U JGQ 895-M24999 bp 1-1336, (There are rnisltaple base pair 
substitutions and deletions between the quail and chicken sequences* so the number of 
bases does not correspond exactly.) 

Bp 5712 — 6910 Quail Ovalbumin gene* EMBL accession # X53964 9 bp 1-1 199. (This 
sequence includes the 5*UTR 7 containing putative cap site hp 5712-5764.) 
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Bp 6917 - 7066 Synthetic spacer sequence and hairpin loop of HIV gp4l with an 
added enterokinase cleavage site 

Bp 7067 - 7327 Human proinsulia GenBank Accession # NM000207 bp 1 17-377 
Bp 7328 - 7372 Spacer DNA, derived as an artifact from the cloning vectors pTOPO- 
5 Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems) 

Bp 7373 - 7724 Synthetic poiyA from the cloning vector gWIZ (Gene Therapy 
Systems) bp 1920-2271 

Bp 7726 - ] 1325 from cloning vector pTnMCS, bp 3716 - 7315 

10 pTnMCS fCHOVepi>repro-ent-hGIi : CPAj 
Bp 2 <- 3670 from vector PTnMCS, hp 1-3670 

Bp 3676 - 4350 Chicken Ovalbumin enhancer taken from GenBank accession # 
S82527,l,bp 1-675 

Bp 4357 - 5692 Chicken Ovalbumin promoter taken from GenBank accession # 
15 J00899-M24999, bp 1-1336 

Bp 5699 - 5869 Capsite/Prepro taken Iron GenBank accession # X074G4, bp 563-733 
Bp 5B77 - 6025 Synthetic spacer sequence and hairpin loop of HIV gp4i with an 
added enterokinase cleavage site 

Bp 6026 - 6679 Human growth hormone taken from GenBaxsk accession # ¥005 1 9, 
20 bp 1-654 

Bp 6686 - 7Q93 Conaibumm poly A taken from GenBank accession # Y00407, bp 
10651-11058 

Bp 7095 - 10694 from cloning vector pTnMCS, bp 3716-7315 
Bp I - 3670 from cloning vector pTnMCS, bp 1 ~ 3670 

Bp 3676 - 4333 Quail Ovalbumin enhancer: 658 bp sequence, amplified in-house 
from quail genomic DNA, roughly equivalent to the far- upstream chicken ovalbumin 
enhancer, GenBank accession #882527,1, bp 1-675. (There are multiple base pair 
30 substitutions and deletions in the quml sequence, relative to chicken, so the number of 
bases does not correspond exactly.) 

Bp 4340 - 5705 Quail Ovalbumin promoter: 1366 bp sequence, amplified in-house 
from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, 
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GenBank accession # J00895-M24999 bp 1-1336, (There are multiple base pair 
substitutions and deletions between the quail and chicken sequences, so the number of 
bases doss not correspond exactly.) 

Bp 5712 - 5882 Cecropm cap site and Prepro, Genbank accession # X07404 bp 563- 
5 733 

Bp 5889 - 6038 Synthetic spacer sequence and hairpin loop of HIV gp4! with an 
added enterokinase cleavage site 

Bp 6039 - 6299 Hitman proinsuHn GenBank Accession # NM000207 bp 1 17-377 
Bp 6300 - 6344 Spacer DNA, derived as an artifact from the cloning vectors pTQFO 
] 0 Blunt II (Tnvitrogen) and gWIZ (Gene Therapy Systems) 

Bp 6345 - 6696 Synthetic polyA from the cloning vector gWIZ (Gene Therapy 
Systems) bp 1920 -2271 

Bp 6698 - 1 0297 from cloning vector pTnMCS, bp 37 i 6 ~ 73 1 5 



1 5 PTnMOD 

Bp 1 -130 remainder of FI (-) ori of pBluesedptll sk0 (Stratagene) bpl-130 

Bp 133 - 1777 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy 

Systems) fop22SM873 

Bp 1783 - 2991 Transposase, modified from TnlO (GenBank accession #101829) bp 
20 108-1316 

Bp 2992 ~ 2994 Engineered stop codon 

Bp 2996 - 34 II Synthetic polyA from gWIZ (Gene Therapy Systems) hp 1922 - 2337 
Bp 3412 - 3719 Non-coding UNA from vector pNK2859 

Bp 3720 - 3762 Lambda DNA from pNK2859 
25 Bp 3763 » 3832 70 bp of IS 1 0 left ftom Tn 1 0 

Bp 3839 - 4045 Multiple cloning site from pBluescriptll sk(~), thru the Xxnal site bp 
924-7 IS 

Bp 4046 - 4090 Multiple cloning site from pBIuescriptH sk{-), from the Xmal site 
thmj the Xhol site. These base pairs are usually lost when cloning into pTnMCS. bp 
30 717-673 

Bp 4091 - 4528 Multiple cloning site from pBluescriptll sk(-) f from the Xhol site bp 
672-23S 



90 



WO 2004/0031S7 



PCT/US2093/020389 



Bp 4534 - 4603 70 bp of 1810 left from TnlG 
Bp 4604 - 4645 Lambda DNA from pNK2859 
Bp 4646 - 5489 Non-coding DNA from pNK2859 

Bp 5490 - 7690 pBiuescript sk (~) base vector (Stratagene> INC) bp 761-296 i 

5 

Bp I - 4045 fiom vector PTnMCS, bp 1 - 4045 

Bp 4051 - 4725 Chicken Ovalbumin enhancer taken from GenBank accession # 
S82527 J t bp I - 675 

1 0 Bp 4732 - 6067 Chicken Ovalbumin promoter taken from GenBank accession # 
J0O899-M24999, bp 1-1336 

Bp 6074 - 6245 Capsite/Frepro taken iron GenBank accession # X07404, bp 563 - 
733 

Bp 6252 - 6400 Synthetic spacer sequence and hairpin loop of HIV gp4 1 with an 
1 5 added ettterokinase cleavage site 

Bp 6401 - 7054 Human growth liomiorse taken from GenBank accession # V00519, 
bp 1-654 

Bp 7061 - 746S Conalbumin poly A taken from GenBank accession # Y00407, bp 
S 065 1-11058 

20 Bp 7470 - 1 1 069 ftom cloning vector pTtiMCS, bp 3716-73 1 5 
Bp 1 - 4045 from vector PTnMCS, bp 1 - 4045 

Bp 4051 - 5695 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy 
25 systems), bp 230-1864 

Bp 5702 -6855 Chicken ovalbumin gene taken from GenBank accession # V0Q383, 
bp 66-1219 

Bp 6862 - 7011 Synthetic spacer sequence and hairpin loop of HIV gp4! with an 
added enterokinase cleavage site 
30 Bp 7012 - 7272 Human Proinsulin taken from GenBank accession # NM000207, bp 
117-37? 

Bp 7273 - 7317 Spacer DNA, derived aa an artifact from the cloning vectors pTOPO 
Blunt II (Invitrogen) and pGWIZ (Gene Therapy Systems) 
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Bp 73 3.8 - 7670 Synthetic .poly A from the cloning vector pGWIZ (Gene Therapy 
Systems), bp 1920-2271 

Bp 7672 -1 1271 from cloning vector pTnMCS, bp 3716-7315 
Bp I - 4045 from vector PTnMCS, hp ! - 4045 

Bp 4051 - 5693 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy 
systems), bp 230-1 864 

Bp 5701 - 5871 Capsiie/Frepro taken Iron GenBank accession # X07404, bp 563 - 
10 733 

Bp 5879 - 6027 Synthetic spacer sequence and hairpin loop of HIV gp41 with an 
added enterokinase cleavage site 

Bp 6028 - 6681 Human growth hormone taken from GenBank accession # V005 19 5 
bp 1-654 

15 Bp 6688 - 7095 Conalbnnim pofiyA taken from GenBank accession # Y00407, bp 
106S1-U058 

Bp 7097 - 10696 from cloning vector pTnMCS. bp 3716-7315 

ffTnMQD (CMV -^re?>ro-ent-FroInsuiin-s^PA s ; 
20 Bp 1 - 4045 from vector PTnMCS, bp 1 - 4045 

Bp 4051 - 5695 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy 
systems), bp 230-1 864 

Bp 5701 - 5S71 Capsile/Prcpro- taken from GenBank accession # X07404 ; bp 563 - 
733 

25 Bp 5879 - 6027 Synthetic spacer sequence and hairpin loop of HIV gp41 with an 
added enterokinase cleavage site 

Bp 6028 - 6288 Human Proinsulin taken from GenBank accession # NMOOG207, bp 
117-377 

Bp 62S9 - 6333 Spacer DNA, derived as an artifact from the cloning vectors pTOPO 
30 Blunt II (Invitrogen) and pGWIZ {Gene Therapy Systems) 

Bp 6334 - 6685 Synthetic polyA from the cloning vector pGWIZ (Gene Therapy 
Systems), hp 1920-2271 

Bp 6687 -10286 from cloning vector pTnMCS, bp 3716-7315 
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Bp 1 - 4045 from cloning vector pTnMOD, bp I - 4045 

Bp 4051 — 4725 Chicken Ovalbumin enhancer taken from GenBank accession # 
5 S82527J bp 1-675 

Bp 4732 - 6067 Chicken Ovalbumin promoter taken from GenBank accession # 
J00895-M24999 bp 1-1336 

Bp 6074 - 7292 Chicken Ovalbumin gene from GenBank Accession # V00383.1 bp 
2-1220. (This sequence includes the 5*UTR, containing putative cap site bp 6074- 
10 6137,) 

Bp 7299 - 7448 Synthetic spacer sequence and hairpin loop of HIV gp41 with an 
added enterokinase cleavage site 

Bp 7449 - 7709 Human proinsulin GenBank Accession # NM000207 bp 1 17*377 
Bp 7710 - 7754 Spacer DNA, derived as an artifact from the cloning vectors pTOFO 
15 Blunt II (Invitrogen) and gWlZ (Gene Therapy Systems) 

Bp 7755 - 8106 Synthetic polyA from the cloning vector gWIZ (Gene Therapy 
Systems) bp 1920 - 2271 

Bp 8108 - 1 1707 from cloning vector pTnMCS, bp 3716 - 7315 

20 ffTnMQDf Chicken O Yei^T?^pTo^£NT4-prQin^yn poly A) 
Bp 1 - 4045 from cloning vector pTnMCS* bp 1 - 4045 

Bp 4051 — 4725 Chicken Ovalbumin enhancer taken from GenBank accession # 
SS2527.1 bp 1-675 

Bp 4732 - 6067 Chicken Ovalbumin promoter taken from GeEiBank accession # 
25 J00895-M24999 bp 1-1336 

Bp 6074 — 6244 Cecropin cap site and Prepro, Genbank accession # X07404 bp 563- 
733 

Bp 6251 - 6400 Synthetic spacer sequence and hairpin loop of HIV gp41 with an 
added enfcerokinase cleavage site 
30 Bp 6401 - 6661 Hi&man proinsulin GenBank Accession # NM000207 bp 1 1 7-377 

Bp 6662 - 6706 Spacer DNA, derived as an artifact from the cloning vectors pTOPO 
Slant II (Invitrogen) and gWIZ (Gene Therapy Systems) 
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Bp 6707 - 7058 Synthetic polyA from the cloning vector gWIZ (Gene Therapy 
Systems) bp 1 920 - 227 J 

Bp 7060 - 10659 from cloning vector pTnMCS, bp 3716 - 7315 
Bp 1 -4045 from cloning vector pTnMCS, bp i - 4045 

Bp 4051 ~ 4708 Quail Ovalbumin enhancer: 658 bp sequence amplified m~house 
from quail genomic UNA, roughly equivalent to the fat-upstream chicken ovalbumin 
enhancer, GenBarsk accession # SS2527J, bp 1-675, (There are multiple base pair 
10 stEhstitntions and deletions in the quail sequence, relative to chicken, so the number of 
bases does not correspond exactly.) 

Bp 4715 - 6080 Quail Ovalbumin promoter: 1366 bp sequence, amplified in-honse 
from qu&il genomic DMA, roughly corresponding to chicken ovalbumin promoter, 
GesiBank accession # J0OS95-M24999 bp 1-1336, (There are multiple base pair 
15 substitutions and deletions between the quail and chicken sequences, so the number of 
bases does noi correspond exactly,) 

Bp 60S7 - 72S5 Quail Ovalbumin gene, EMBL accession # X53964, bp M 199. (This 
sequence includes the 5 ? UTR ; containing putative cap site bp 6087-6139 ) 
Bp 7292 - 7441 Synthetic spacer sequence and hairpin loop of HIV gp4! with an 
20 added enterokinase cleavage site 

Bp 7442 - 7702 Human proinsulin GenBank Accession # NM000207 bp 1 1 7-377 

Bp 7703 - 7747 Spacer DNA, derived as an artifact from the cloning vectors pTOPO 

Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems) 

Bp 7748 - 8099 Synthetic polyA from the cloning vector gWIZ (Gene Therapy 
25 Systems} bp 1 920 - 2271 

Bp 8101 - 1 1700 from cloning vector pTnMCS, hp 3716 - 7315 

Bp I - 4045 from cfoning vector pTnMCS, bp ! - 4045 
30 Bp 4051 - 4708 Quail Ovalbumin enhancer: 658 bp sequence, amplified in- 
housefrorn quail genomic DNA ? roughly equivalent to the fer-upstream chicken 
ovalbumin enhancer, GenBank accession #882527 J , bp 1-675, (There are multiple 
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base pair substitutions and deletions in the quail sequence* relative to chicken, so the 
number of bases does not correspond exactly.) 

Bp 4715 - 6080 Quail Ovalbumin promoter: 1366 bp sequence, amplified in-house 
from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, 
5 GenBank accession # J0G895-M24999 bp 1-1336. (There are multiple base pair 
substitutions and deletions between the qisaii and chicken sequences, so the number of 
bases does not correspond exactly,) 

Bp 60S? - 6257 Ceeropin cap site and Prepro, Gehb&nk accession # X07404 bp 563- 
733 

10 Bp C5264 - 6413 Synthetic spacer sequence and hairpin loop of HIV gp41 with an 
added enierokinase cleavage site 

Bp 6414 - 6674 Human proinsulin GenBasik Accession # NM000207 bp 1 17-377 
Bp 6675 - 6719 Spacer DNA, derived as an artifact from the cloning vectors pTOPO 
Blunt II (Invitogen) and gWIZ (Gesne Therapy Systems) 
15 Bp 6720 - 7071 Synthetic poiyA from the cloning vector gWIZ (Gene Therapy 
Systems) bp 1920 - 2271 

Bp 7073 - 10672 from cloning vector pTnMCS, bp 371 6 - 731 5 

20 BP 1-130 remainder of Fl (-) ori of pBiuescripill sk(») (Stragagene) bp 1-130, 

BP 133-1777 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy 
Systems) bp 229- 1 873 . 

BP 1780-2987 Transposase, modified from TnlO (GenBank #101829). 
BP 29S8-2990 Engineered stop codes*. 
25 BP 2991-3343 non coding DNA fxom vector pNK2SS9, 
BP 3344-3386 lambda DNA from pNK2859. 
BP 3387-3456 70bp of IS10 left from TnlO, 

BP 3457-3674 multiple cloning site from pBluessriptEI sk(-) bp 924-707. 
BP 3675-5691 Chicken Ovalbumin enhancer plus promoter from a Topes Clone 10 
30 maxi 040303 (5* Xmat 3* BamHI) 

BP 5698-5865 prepro with Cap site amplified from ceeropin of pMON200 
GenBank # X07404 {S'BamHt, 3'KpnX) 
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BP 5872-7338 Protein A gene from GenBank# J017E6, mature peptide bp 292-1755 
(5'KpnI, 3'SacTI) 

BP 7345-7752 CoxiPolyA from Chicken coraalbumin polyA from GenBank # Y00407 
bp 10651-1 1058, (S'SacXI, 3'XhoI) 
5 BP 7753-8195 multiple cloning" site from pBkescriptll sk(~) bp 677-235. 
BP 81 96-8265 70 bp of S3 1 0 left from Tn ! 0, 
BP 8266-8307 Laroda DNA ftom pNK2859 
BP S30S-91 51 noncoding DNA from pNK2859 

BP 9152-1 1352 pBluescriptll sk(~) base vector (Stratagem^ INC) bp 761-2961 

10 

All patents, publications and abstracts cited above are incorporated herein by 
reference in their entirety. It should be understood that the foregoing relates only to 
preferred embodiments of the present invention and that nirmeroiss modifications or 
alterations may be made therein without departing from the spirit and the scope of the 
1 5 present invention as defined in the following claims. 
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LA vector comprising: 

a) a modified transposase gene operabSy-Iinked to a first 
promoter; wherein the nucleic acid sequence 3 5 to the first promoter comprises 
the sequence as set forth in SEQ ID NO: 13, wherein SEQ ID NO: 13 contains 
the Kozak sequence and a start codon for the transposase, and wherein at least 
\Q one of the first twenty codons for the transposase gene are modified from the 

O wild-type sequence by changing a nucleotide at a third base position of the 

s£5 codon to an adenine or thymine without modifying the amino acid encoded by 

CO 

^3 b) one or more genes of interest operably-Hnked to one or more 

<NS additional promoters; and wherein the one or more genes of interest and their 

operably-linked promoters are flanked by transposase insertion sequences 
recognized by the transposase encoded by the modified transposase gene. 



2, The vector of claim ! s wherein the modified transposase gene 
comprises an adenine or thymine at the third position in each of codons 2-10 
of the modified transposase gene. 

3. The vector of claim 1 or 2, comprising the sequence set forth in SEQ 
IDNG:L 

4. The vector of any one of claims 1 to 3, wherein the transposase is a 
TnlO transposase. 

5, The vector of any one of claims 1 to 4, wherein one gene of interest is 
operably-Hnked to a second promoter. 

6. The vector according to any one of claims i to 5, wherein the first 
promoter and the second promoter are independently selected from the group 
consisting of a constitutive promoter and an inducible promoter, 

7, The vector of claim 6, wherein the inducible promoter is selected from 
the group consisting of an ovalbumin promoter, a conaSbumin promoter, a 
vitellogenin promoter or an ovomucoid promoter. 
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8, The vector according to any preceding claim, further comprising a 
poly A sequence operahly- Imked to the transposase gene, 

V 

9, The vector of claim 8, wherein the polyA sequence is a conalbumin 
polyA sequence. 

10, The vector according to any preceding claim, further comprising two 
stop codons operably-linked to the transposase gene. 

I ! . The vector of any one of claims I to 4, wherein a first gene of interest 
is operabSy-Hriked to a second promoter and a second gene of interest is 
operahiy-linked to a third promoter, 

12. The vector of any one of claims 1 to 4, wherein a first and a second 
gene of interest are operably-I inked to a second promoter. 

13. The vector according to any preceding claim, further comprising an 
enhancer operably-linked to the one or more genes of interest 

14. The vector of claim 13, wherein the enhancer comprises at least a 
portion of an ovalbumin enhancer, 

15. The vector according to any preceding claim, further comprising an 
egg directing sequence operably-linked to the one or more genes of interest. 

16. The vector of claim 15, wherein the egg directing sequence is an 
ovalbumin signal sequence, an ovomucoid signal sequence or a vitellogenin 
targeting sequence, 

1 7. A method of producing a transgenic animal comprising administering 
to the animal a vector according to any one of claims 1 to 16. 

18. The method of claim 17, wherein the vector is administered via an 
intratesticular, intraarterial, intraoviductal or intraembryonie route. 
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19. The method of claim 17 or !S S wherein the animal is an avian animal. 



20. The method of claim 19, wherein the avian animal is a chicken or a 
quail. 

2L An egg produced by the transgenic avian animal of claim 19 or 20, 
wherein the egg contains one or more desired proteins encoded by the one or 
more genes of interest. 

22. A transgenic sperm produced by the transgenic animal produced 
according to any one of claims 17 to 20, 

23. A method for producing a desired protein comprising: 

a) administering to the animal a vector according to any one of 
claims 1 to 16; and 

b) isolating the desired protein produced in the animal. 

24. The method of claim 23, wherein the animal is an egg-laying animal, 
and the method of administration is intraoviductal, such that the desired 
protein produced by the at least one gene of interest is isolated from the egg 
white of eggs laid by the egg-laying animal 

25. The method of claim 23 or 24, wherein the vector further comprises a 
TAG sequence and wherein the desired protein is purified using the TAG 
sequence, 

26. The method of claim 25, wherein the TAG sequence comprises: (i) a 
sequence that encodes a polypeptide that functions as a purification handle; 
(is) a cleavage site; and (iii) a polynucleotide spacer, 

27. The method of claim 25 or 26, wherein the TAG sequence comprises a 
polynucleotide sequence shown in SEQ ID NO: 22, 

28. The method of any one of claims 23 to 25, wherein the desired protein 
is a lytic protein, proinsulin, or a human growth hormone. 



29. The method according to any one of claims 23 to 28, wherein the 
vector further comprises a second gene of interest operably-Unked to a third 
promoter and wherein the genes of interest encode antibody polypeptides* 

30. The vector of claim 7, wherein the inducible promoter comprises the 
sequence as set forth in SEQ ID NO: 17, SEQ ID NO: 40, or nucleic acids 
4050-4938 of SEQ ID NO: 30, 

31. The vector of claim 15 or 16, wherein the egg directing sequence 
comprises at least one of the sequences as set forth in SEQ ID NO; 18, nucleic 
acids 4960-51 12 of SEQ ID NO: 3, nucleic acids 4943-5092 of SEQ ID NO: 
4, nucleic acids 4958-61 15 of SEQ ID NO: 29, or nucleic acids 4945-6092 of 
SEQ ID NO: 30. 



32. The vector of claim 8, wherein the polyA sequence comprises at least 
one of the se„ a S se« forth in SEQ ID NO: 28, SEQ ,D NO: 33, or 
nucleic acids 2995-34 1 0 of SEQ ID NO: I . 

33. The vector according to any one of claims 1 to 16, wherein the 
modified transposase gene comprises an A or a T at the third position in each 
of codons 2-10 of the modified transposase gene. 

34. The method of claim 23, wherein the vector is administered via an 
intratesticular, intraarterial, intraperitoneal, intravenous, intraoviductal, 
intraembryonic, nasal, or pronuclear route, 
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Appendix A 

SEQ ID (pTnMod) 

5 CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG SO 
CGCAGCGTGA CCGCTACACT TGCC&GCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCTTCCCT TCCTTTCTCG GCACGTTCGC CGGCATCAGA TTGGCTATTG 150 
GCCAT7GCAT ACGTTGTATC C ATAT C AT AA TATGTACATT TATATTGGCT 2 CO 
CATGTCCAAC ATTACCGCCA TCTTGACATT GATT&TTGAG TAGTTATTAA 250 

10 TAGTAATCAA TTACGGGGTG ATTAGTTCAT AGCCCATATA TGGAGTTCCG 3 00 
CGTTACATAA CTTACGGTAA ATGGGCGGCC TGGCTGACCG CCCAACGACC 35 0 
CCCGCCCATT GACGTCAATA ATGAC'GTATG TTCCCATAGT AACGGCAATA 400 
GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 450 
CTTGGCAGTA CATCARGTGT &TCAT ATG CC AAGT A CG CCC CCTATTGACG 500 

15 TCAATGACGG TAAATGGCCC GCCTGGC ATT ATGCCCAGTA CATGACCTTA 55 0 
TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 6 00 
CATGGTGATG CGGTTTTGGG AGTACATGAA TC3GGCGXGGA TAG CG GT'TTG £50 
ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 700 
TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTGGTA ACA^CTCCGC 75 0 

20 CCOATTGACG CAAATGGGCG -GTAGGCGTGT ACGGTGGGAG GTCTATATAA 90 0 
GCAGAGCTCG TTTAGTGAAG CGTCAGATCG CCTGGAGACG CCATCCACGC 850 
TGTTTTGACC TCCATAGAAG AC AC CGGGAC CGATCCAGCC TCCGCGGCOG 300 
GGAACGGTGC ATT GGAACGC GOAT TCCCCG TGCCAAGAGT GACGTAAGTA 350 
CCGCCTATAG ACT CTAT AG G CACACCCCTT TGGCTCTTAT GCATGCTATA 10 0 0 

25 CTGTTTTTGG CTTGGGGCCT ATACACCCCC GCTTCCTTAT GCTATAGGTG 105 0 
ATGGTATAGC TTAGCCTATA GGTGTGGGTT ATTGACCATT ATTGACCACT 1100 
CCCCTATTGG T GACGAT ACT XT CC ATT ACT AATCCATAAC AT GGCT CTTT 115 0 
GCCACAACTA TCTCTATTGG CTATATGCCA ATACTCTGTC CTTCAGAGAC 12 00 
TGACACGGAC TCTGTATTTT TACAGGATGG GGTCCCATTT ATTATTTACA 125 0 

30 AATTC A CAT A TACAACAACG: CCGTCCCCCG TGCCCGCAGT TTTTATTAAA 1300 
CATAGCGTGG GATCTCCAGG GGAATCTCGG GTACGTGTTC CGGACATGGG 1350 
CTCTTCTCCG GTAGCGGCGG AGCTTCCAGA TCCGAGCGCT GGTCCCATGC 14 0 0 
CTCCAGCGGC TCATGGTCG C TCGGCAGCTC CTTGCTCCTA ACAGTGGAGG 1450 
CCAGACTTAG GCACAGCACA ATGCGCACGA CCACCAGTGT GCCGCACAAG 1500 

35 GCCGTGGCGG TAGGGTATGT GTGTGAAAAT GAGCGTGGAG ATTGGGCTCG 1550 
CACGGCTGAC GCAGATGGAA GACTTAAGGC AGGGGCAGAA GAAGATGCAG 1600 
GCAGCTGAGT TGTTGTATTC TGATAAGAGT CAGAGGTAAC TCCCGTTGCG 1650 
GTGCTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTTGCTGC HQ0 
CGCGCGCGCC ACC AG AC AT A ATAGC7GACA GACTAACAGA CTGTTCCTTT I7S0 

40 CCATGGGTCT TTTCTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT IS 00 
TTACATGATT CTCTTTACGA ATTCXGCCCC GAATTACACT TAAAACGACT 185 0 
CAACAGCTTA ACGTTGGCTT GCCACGCATT ACTTGACTGT AAAAGTCTCA 1900 
CTCTTACCGA ACTTGGCCGT AACCTGCCAA CCAAA G€G AG AACAAAAC AT 1950 
AACATCAAAC GAATCGACGG ATTGTTAGGT AATCGTCACC TCCACAAAGA 2 000 

45 GCGACTCGCT GTATACCGTT GGCATGCTAG CTTTATCTGT TCGGGAATAC 2050 
GATGCCCATT GTACTTGTTG ACTOGTCTGA TATTCGTGAG GAAAAACGAC 2100 
TTATGGTATT GCGAGCTTCA GTCGCACTAC ACGGTCGTTC TGTTACTCTT 215 0 
TATGAGAAAG CGTTCCCGCT TTCAGAGCAA TGTTCAAAGA AAGCTCATGA 2200 
CCAATTTCTA GCCGACCTTG CGAGO&TTCT ACCGAGTAAC ACCACACCGC 2250 

50 TCATTGTCAG TG ATG CTG O C TTTAAAGTGC CATGGTATAA AT CCGTTG AG 23 00 
AAGCTGGGTT GGTACTGGTT AAGTCGAGTA AGAGGAAAAG TACAATATGC 2350 
AGACCTAGGA GCGGAAAACT GG AAAC CT AT CAGCAACTTA CATGATATGT 2400 
CAT C TACTC A CTGAAAGAC? TTAGC-CTAXA AGAGGCTGAC TAAAAGCAAT 2450 
CCAATCTCAT G CC AAATT CT ATTGTATAAA TCTCGCTCTA AAGGCCGAAA 2S00 

55 AAATCAGCGC TCGAGACGGA GTCATTGTCA CCACCCGTCA CCTAAAATCT 2550 
ACTCAGCGTC GGCAAAGGAG CCATGGGTTC TAGCAAGTA& CTTACCTGXT 2600 
G A A ATT CG A A CACCCAAACA ACTTGTTAAT ATCTATTCGA AGCGAATGCA 2650 
GATI'GAAGAA ACCTTCCGAG ACTTGAAAAG TCCTGCCTAC GG ACT AGG CC 2700 
T AC G C CAT AG CCGAACGAGC AGGTCAGAGC GTTTTGATAT CATGCTGCTA 27 50 

60 ATCGCCCTGA TGCTTCAACT AACATGTTGG CTTGGGGGCG TTCATGCTCA 2300 
G AAA C AAGGT TGGGACAAGC ACTTCC&GGC TAACACAGTC AGAAATCGAA 2350 
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ACGTACTCTC AACAGTTCGC TTAGGCATGG AAGTTTTGCG GCATTCTGGC 2300 
TACACAATAA CAAGGGAAGA CTTACTCGTG GCTGCAACCC TACTAGCTCA 2 950 
AAATTTATTC ACAC&TGGTT ACGCTTTGGG GAA&TTATGA TAATGATCCA 3 0 GO 
GATCACTCCT GGCTAATAAA AGATCAGAGC TCTAGAOATC TGTGTGTTGG 3 050 
5 TTTTTTGTGG ATCTGCTGTG CCTTCTAGTT GCCAGCCATC TGTTGTTTGC 3100 
CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT 3150 
TTCGTAATAA AATGAGGAA& TTGC&TCGC& TTGTCTGAGT AGGTGTCATT 32 00 
CTATTCTGGG GGGTGGGGTG GGGCAGCACA GCAAG GGGGA GGATTGGGAA 3250 
GACAATAGCA GGGATGCTGG GGATGCGGTG GGCTCTATGG GTACCTCTCT 3 3 00 

10 CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCGGTAC CTCTCTCTCT 3 3 50 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCAGG TGCTGAAGAA 34 00 
TTGACCCGGT GACCA&AGGT GCCTTTTATC ATCACTTTAA AAATAAAAAA 34 50 
CAATTACTCA GTGCCTGTTA TAAGGAGCAA TTAATTATGA TTGATGCCTA 3500 
CATCACAACA AAAACTGATT TAACAAATGG TTGGTCTGCC TTAGAAAGTA 3550 

15 TATTTGAACA TTATCTTGAT TATATTATTG ATAATAATAA AAACCTTATC 3 600 
CCTATCCAAG AAGTGATQCC TATCATTGGT TGGAATGAAC TPGAAAAAAA 3 650 
TTAGCCTTGA ATACATTACT GGTAAGGTA& ACGCCATTGT GAGCAAATXG 3 7 00 
ATCCAAGAG& ACCAACTTAA AGCTTTCCTG ACGGAATGTT AATTCTCGTT 3 7 50 
G AC CCTGAGC ACTGATGAAT CCCCTAATGA TTTTGGTAAA AATCATTAAG 3 8 00 

20 TTAAGGTGGA TACACATCTT GTCATATGAT CCCGGTAATG TGAGTTAGCT 3 850 
C ACT C ATT AG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT 3 900 
TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC 3 950 
CATGATTA CG CCAAGCGCGC AATTAACCCT CACTAAAGGG AACAAAAGCT 4 0 00 
GGAGCTCCAC CGCGGTGGCG GCCGCTCTAG AACTAGTGGA TCCCCCGGGC 4050 

25 TGCAGGAAXT CGATATCAAG CTTATCGATA CCGCTGACCT CGAGGGGGGG 4100 
CCCGGTACCC AATTCGCCCT ATAGTGAGTC GTATTACGCG CGCTCACTGG 4150 
CCGTCGTTTT ACAACGTCGT GACTGGGAAA ACCCTGGCGT TACCCAACTT 4 2 00 
AATCGCCTTG CAGCACATCC CCCTTTCGCC AGCTGGCGTA ATAGCGAAGA 4 250 
GGCCCGCACC GATCGCCCTT CCCAACAGTT GCGCAGCCTG AATGGCGAAT 4 3 00 

30 GGAAATTGTA AGCGTTAATA TTTTGTTAAA ATTCGCGTTA AATTTTTGTT 4 350 
AAATCAGCTC ATTTTTTAAC CAATAGGCCG AAATCGGCAA AATCCCTTAT 4400 
AAATCAAAAG A&TAOACCGA GATAGGGTTG AGTGTTGTTC CAGTTTGGAA 4 450 
CAAGAGTCCA CTATTAAAGA ACGTGGACTC CAACGTCAAA GGGCGAAAAA 4 500 
CCGTCTATCA GGGCGATGGC CCACTACTCC GGGAT CAT AT GACAAGATGT 4 550 

35 GTATCCACCT TAACTTA&TG ATTTTTACCA AAAT CATTAG GGGATTCATC 4 6 00 
AGTGCTCAGG GTCAACGAGA ATTAACATTC CGTCAGGAAA GCTTATGATG 4 6 SO 
ATGATGTGCT TAAAAACTTA CTCAATGGCT GGTTATGCAT ATCGCAATAC 4 7 00 
ATGCGAAAAA CCTAAAAGAG CTTGCCGATA A AAAAGGC C A ATTTATTGCT 4 750 
ATTTACCGCG GCTTTTTATT GAGCTTG&AA GATAAATAAA ATAGATAGGT 4 8 00 

40 TTTATTTGAA GCTAAATCTT CTTTATCGTA AAAAATGCCC TCTTGGGTTA 4 8 SO 
TCAAGAGGGT CATTATATTT CGCGGAATAA CATCATTTGG TGACGAAATA 4 200 
ACTAAG C ACT TGTCTCCTGT TTACTCCCCT GAG CTTGAGG GGTTAACATG 4 950 
AAGGTCATCG ATAGCAGGAT AATAATACAG TAAAACGCTA AACCAATAAT 5 000 
CCAAATCCAG CCATCCCAAA TTGGTAGTGA ATGATTATAA ATAACAGCAA 5 050 

45 ACAGTAATGG GCCAATAACA CCGGTTG CAT TGGTAAGGCT CACCAATAAT 5100 
CCCTGTAAAG GACCTTGCTG ATGACTCTTT GTTTGGATAG ACATCACTCC 515 0 
CTGTAATGCA GGTAAAGCGA TCCCACCACC AGCCAAT AAA ATTAAAACAG 52 00 
GGAAAACTAA CCAACCTTCA GAT AT AAA CG CTAAAAAGGC AAATG C ACT A 5250 
CTATCTGCAA TAAATCCGAG CAGTACTGCC GTTTTTTCGC CCATTTAGTG 53 00 

50 GCTATTCTTC GTGCCACAAA GGCTTGGAAT ACTGAGTGTA AAAGACCAAG 53 5 0 
AC CCGTAATG AAAAGCCAAC CAT CATG CT A TTCATCATCA CGATTTCTGT 54 00 
AATAGCACCA CACCGTGCTG GATTGGCTAT GAATGCGCTG AAATAATAAT 54 50 
CAACAAATGG CATCGTT&AA TAAGTGATGT ATACCGATCA GCTTTTGTTC 5500 
CCTTTAGTGA GGGTTAATTG CGCGCTTGGC GTAATCATGG TCATAGCTST 555 0 

55 TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAA CATACGAGCC 5 GOO 
GGAAGCATAA AGTGTAAAGC CTGGGGTGCC TAATGAGTGA GCTAACTCAC 5650 
ATTAATTGCG TTGCGCTCAC TGCCCGCTTT CCAGTCGGGA AACCTGTCGT 5 7 00 
GCCAGCTG'CA TTAATGAATC GGCCAACGCG CGGGGAGAGG CGGTTTGCGT 5 750 
ATTGGGCGCT CTTCCGCTTC CTCGCTCACT GACTCGCTGC GCTC GG TCGT 5 800 

60 TCGGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGGCGGTA ATACGGT'TAT 5 850 
CCAGAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC AAAAGGCCAG 5 500 
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CAAAAGGCCA GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG 5 950 
GCTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT £000 
GGCGAAACCC GACAGGACTA xAA^GATACC AGGCGTTTCC GCCTGGA&GC 6 050 
TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC 6100 
5 CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCATAGC TCACGCTGTA 6150 
GGTATCTCAG TTCGGTGTAG GTCGTOCGCT C CAAG CTGGG CTGTGTGCAC 6200 
GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT 6250 
TGAGTCCAAC CCGGTA&GAC ACGACTTATC GCCACTGGCA GCAGCCACTG 63 00 
GTAACAGGAT T AGO AG AG CG AGGT&TGTAG GCGGTGCTAC AGAGTTCTTG 6 3 50 

10 AAGTGGTGGC C T AACT ACGG CT AC ACT AGA AGG AC AGT AT TTGGTATCTG 64 00 
CGCTCTGCTG AAGCC&GTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT 
C CGG CAAAC A AACCACCGCT GGTAGCGGTQ GTTTTTTTGT TTGCAAGCAG 65 00 
CAGATTACGC GCAGAAAAAA AGO ATCT CAA QA&GATCCTT TGATCTTTTC 5550 
TAGGGGGTCT GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG 66 00 

1 5 TO ATGAGATT ATC AAAAAGG ATCTTGACGT AGATCCTTTT AAATTAAAAA 6 650 
TGAAGTTTTA AATCAATCTA AAGTATA.TAT GAGTAAACTT GGTCTGACAG 6 700 
TTACCAATGC TTAATCAGTG fcGGCACCTAT CTCAGCGATC TGTCTATTTC 6 750 
GT T CAT C CAT AGTTGCCTGA CTGCCCGTCG TGTAGATAAC TACGATACGG €8 00 
GAGGGCTTAC CATCTGGCCC CAGTGCTGCA ATGAT& CCGC GAGACCCACG 6 85 0 

20 CTCACCGGCT CCAGATTTAT CAGC&ATAAA CCAGCCAOCC GGAAGGGCCG 6900 
AGCGCAGAAG TGGTCCTGCA ACTTTATCCG CCTCCATCCA GTCTATTAAT 6950 
TGTTGCCGGG A AG CT AG AGT AAGT&GTT CG CCAGTTAA.TA GTTTGCGCAA 7 000 
CGTTGTTGCC ATTGCTAC&G GCATCGTGGT GTCACGCTCG TCGTTTGGXA 7050 
TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT TACATGATCC 7100 

25 CCCATGTTGT GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC CGATCGTTGT 7150 
CAGAAGTAAG TTGGCCGCAG TGTT AT C ACT CATGGTTATG GCAGCACTGC 72 00 
ATAATTCTCT TACTGTCATG CGATCCGTAA GATGCTTTTC TGTGACTGGT 72 50 
GAGTACTCAA CCAAGTCATT CTGAGAATAG TGTATGCGGC GACCGAGTTG 7300 
CTCTTGCCCG GCGTCAATAC GGGATAATAC CGCGCCACAT AGCAGAACTT 73 SO 

30 TAAAAGTGCT CAT CATTGG A AAACGTTCTT C GGGG CGAAft ACTCTCAAGG 7400 
ATCTTACCGC TGTTGAGATC CAGTTCGATG TAACCCACTC GTGCACCCAA 74 50 
CTGATCTTCA GCATCTTTTA CTTTCACCAG CGTTTCTGGG TG AGC Aft AAA 75 00 
CAGGAAGGCA AAATGCCGCA AJVAAAGGGAA TAAGGGCGAC ACGGAAATGT 7550 
TGAATACTCA TACTCTTCCT TTTTCAATAT TATTGAAGCA TTTATCAGGG 7600 

35 TTATTGTCTC ATGAGCGGAT ACATATTTGA ATGTATTTAG AAAAATAAAC 765 0 
AAATAGGGGT TCCGCGCACA TTTCCCCGAA AAGTGCCAC 7639 



SEQ ID NG:2 (PTnMod <CMV/R«d) ? 

40 

CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG SO 
CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCXTCCCT TCCTTTCTCG CCACGTTCGC CGG CATC AGA TTGGCTATTG 150 
GCCATTCCAT ACGTTGTATC CATATCATAA TATGTACATT TATATTGGGT 2 00 

45 CATGTCCAAC ATTACCGCCA TGTTGAGATT GATTATTGAC TAGTTATTAA 23 0 
TAGTAATCAA TTACGGGG7.C ATT AG TTC AT AGCCCATATA TGGAGTTCCG 3 00 
CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 35 0 
CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 4 00 
GGGACTTTCC ATTGACGT'GA. ATGGGTGGAG TATTTACGGT AAACTGC CCA 45 0 

50 CTTGGCAGTA CAT C AAGTGT AT CAT ATGCC AAGTACGCCC CCTATTGACG 500 
T C A AT G ACGG TAAATCGCCC GCCTGGCATT ATGCCCAGTA CATGAGCTTA 55 0 
TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 6 00 
CATGGTGATG CGGTTTTGG C AGTACATCAA TGGGCGTGGA TAGCGGTTTG €50 
ACTCACGGGG ATTTCCAAGT CTGCACCCCA TTGACGTCAA TGGGAGTTTG 7 00 

5t TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC 7S0 
CCCATTGACG C AAATGGG CG GTAGGCGTGT ACGGTGGGAG GTCTATATAA 800 
GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG CCATCCACGC 350 
TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 900 
GGAACGGTGC ATTGGAACGC GG ATTC CCC.G TGCCAAGAGT GACGTAAGTA 950 

60 CCGCCTATAG ACTCTATAGG CACACCCCTT TGGCTCTTAT GCATGCTATA 1000 
CTGTTTTTGG CTTGGGGCCT ATACACCCCC GCTTCCTTAT GCTATAGGTG 1050 
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ATGGTATAGC TTAGCCTATA GGTGTGGGTT ATTGACCATT ATTGAC C ACT 1100 
CCCCTATTGG TGACGATACT TTCCATTACT AATCCATAAC ATGGCTCTTT 1150 
GCCACAACTA TCTCTATTGG CTATATGCCA ATACTCTGTC CTTCAGAGAC 12 00 
TGACACGGAC TCTGTATTTT TACAGGATGG GGTCCC&TTT ATTATTTACA 1250 
5 AATTCACATA TACAACAACG CCGTCCCCCG TGCCCGCAGT TTTTAT T AAA 13 00 
CATAGCGTGG GATCTCCACG CGAATCTCGG QTACQTOTTC CGGACATGGG 13 SO 
CTCTTCTCCG GTAGCGGCGG AGCTTCCACA TCCOAGCCCT GGTCCCATGC 14 00 
CTCCAGCGGC TCATGGTCGC TCGGCAGCTC CTTGCTCCTA ACAGTGGAGG 1450 
CCAGACTTAG GCACAGCACA ATGCCCACCA CCACCAGTGT GCCGCACAAG 1500 

10 GCCGTGGCGG TAGGGTATGT GTGTGAAAAT GAGCGTGGAG ATTGGGCTCG 1550 
CACGGCTGAC GGAGATG0AA OACTTAAGGC AGCGGCAGAA GAAGATGCAG 1600 
GCAGCTGAGT TGTTGTATTC TGATAAGAGT CAGAGGTAAC TCCCGTTGCG 16 SO 
GTGGTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTTGCTGC 1700 
CGCGCGCGCC ACCAGACATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 1750 

15 CCATGGGTCT TTTCTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT 18 00 
TTACATGATT CTCTTTACCA ATTCTGCCCC GAATTACACT T AAAAC G ACT 1850 
CAACAGCTTA ACGTTGGCTT GCCACGCATT ACTTGACTGT AAAACTCTCA 1900 
CTCTTACCGA ACTTGGCCGT AACCTGCCAA CCAAAGCGAG AACAAAACAT 1950 
AACATCAAAC GAATCGACCG ATTGTTAQGT AATCGTCACC TCCACAAAGA 5000 

20 GCGACTCGCT GTATACCGTT GGCATGCTAG CTTTATCTGT TCGGGAATAC 2050 
GATGCCCATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG CAAAAACGAC 2100 
TTATGGTATT GCGAGCTTCA GTCGCACTAC ACGGTCGTTC TGTTACTCTT 2150 
TATGAGAAAG CGTTCCCSCT TTCAGAGCAA TGTTCAAAGA AAGCTCATGA 22 00 
CCAATTT OTA GCCGACCTTG CG&GCATTCT ACCGAGTAAC ACCACACCGC 2250 

25 TCATTGTCAG TGATGCTGOC 7TTAAAGXGC CATGGTATAA ATCCGTTGAG 23 00 
AAGCTGGGTT GGTACTGGTT AAGTCGAGTA AGAGG AAAAG TACA&TATGC 23 50 
AGACCTAGGA GCGGAAAACT GGAAACCTAT CAGCAACTTA CATGATATGT 24 00 
CATCTAGTCA CTCAAAGACT TTAGGCTATA AGAGG CTG AC TAAAAGCAAT 24 SO 
C C AATCT CAT GCCAAATTCT AXTQTATAAA TCTCGCTCTA AAGGCCGAAA 25 00 

30 AAATCAGCGC TCGACACGGA CTCATTGTCA CCACCCGTCA CCTAAAATCT 25 SO 
ACTCAGCGTC GGCAAAG GAG CCATGGGTTC TAG CAACTAA CTTACCTGTT 26 00 
GAAATTCGAA CACCCAAACA ACTTGTTAAT ATCTATTCGA AGCGAATGCA 26 SO 
GATTGAAGAA ACGTTCCGAG &CTTGA&AAG TCCTGCCTAC GGACT AGGCC 27 00 
TACGCCATAG CGGAAGGAGC AGCTCAGAG C GTTTTGATAT CATGCTGCTA 27 50 

35 ATCGCCCTGA TGCTTCAACT AACATGTTGG CTTGCGGGCG TTCATGCTCA 2800 
GAAACAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC AGAAATCGAA 2850 
ACGTACTCTC AACAGTTCGC TTAGGCATGG AAGTTTTGCG GCATT CTGGC 2900 
TACACAATAA CAAGGGAAGA CTTACTCGTG GCTGCA&CCC TACTAGCTCA 2950 
AAATT T ATTC ACACATGGTT ACGCTTTGGG GAAATTATGA TAATGATCCA 3 0 00 

40 GATCACTTCT GGCTAATAAA AGATCAGAGC TCTAGAGATC TGTGTGTTGG 3050 
TTTTTTGTGG ATCTGCTGTG CCTTCTAGTT GCCAGCCATC TGTTGTTTGC 3100 
CCGTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT 3150 
TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 32 00 
CTATTCTGGG GGGTGGGGTG GGGCAGCAC& GCAAGGGGGA GGATTGGGAA 32 SO 

45 GACAATAGCA GGCATGCTGG GGATGCGGTG GGCTCTATGG GTACCTCTCT 33 00 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCGGTAC CTCTCTCTCT 33 SO 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCAGG TGCTGAAGAA 34 00 
TTGACCCOGT GACCAAAGGT GCCTTTTATC ATCACTTTAA AAATAAAAAA 3 4 50 
CAATTACfCA GTGCCTGTTA TAAGCAGCAA TTAATTATGA TTGATGCCTA 3 5 00 

50 CATCACAACA AAAACTGATT TAACAAATGG TTGGTCTGCC TTAGAAAGTA 3 550 
TATTTGAACA TTATCTTGAT TATATTATTG ATAATAATAA AAACCTTATC 36 00 
CCTATCCAAG AAGTGATGCC TATCATTGGT TGGAATGAAC TTGAAAAAAA 3<S50 
TTAGCCTTGA ATA C ATT' ACT GGTAAGGTAA A CO C C ATTGT CAGCAAATTG 37 00 
ATCCAAGAGA ACCAACTTAA AGCTTTCCTG ACGGAATGTT AATTCTCGTT 3 750 

55 OACCCTGAGC ACTGATGAAT CCCCTAATGA TTTTGGTAAA AATCATTAAG 38 00 
TTAAGGTGGA TAGACATCTT GTCATATGAT CCCGGTAATG TGAGTTAGCT 3850 
CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT 3 900 
TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCT ATG AC 3 S S 0 
CATGATTACG CCAAGCGCGC AATTAACCCT CACTAAAGGG AACAAAAGCT 4 0 00 

60 GGAGCTCCAC CGCGGTGGCG GCCGCTCTAG AACTAGTGGA TCCCCCGGGC 4 050 
ATCAGATTGG CTATTGGCCA TTGCATACGT TGTATCCATA TCATAATATG 4100 
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TACATTTATA TTGOCTCATG TCC AAC A TTA CCGCCATGTT GACATTGATT 4 1 50 
ATTC3 ACTAGT TATTAATAGT AATCAATTAC GGGGTCATTA GTTCATAGCC 4200 
CATATATGGA GTTCCGCGTT AC AT A ACT TA CGGTAAATGG CCCGCCTGGC 42 50 
TGACGGCCCA ACGACCCCCG GCCATTGACG TCAATAATGA CGTATGTTCC 4 3 00 
5 CATAGT&ACG CCAATAGGGA CTTTCCATTG A C GT C AATGG GTGGAGTAXT 4 3 50 
TACGGTAAAC TGG CC ACTTG GCAGTACATC AAGTGXATCA TATGCCAAGT 44 00 
AOGCCeeCTA TTGACGTCAA TGAGGGTAAA T0GCCCGCCT GGCATTATGC 4450 
CCAGTACATG ACCTTATGGG ACTTTCCTAC TTGGCAGTAC ATCTACGTAT 4 5 00 
TAGTCATCGC TATTACCATG GTGATGCGGT XTTGGCAGTA CAT CftATGGG 4 550 
10 CGTGGATAGC GGTTTGAGTG AGGQCSGATTT GCAAGTCTCC ACCCCATTGA 4 600 
CGTCAATGGG AGTTTGTTTT GGCACCA&AA TCAACGGGAC TTTCCAAAAT 4 €50 
GTCGTAACAA CTCCGCCCCA. . TTGACGCAAA TGGGCGGTAG GCGTGTACGG 4 7 00 
TGGGAGGTCT ATATAAGCAG AGCTCGT?TA GTGAACCGTC AGATCGCCTG 4 750 , 
GAGACGCCAT CCACGCTGTT TTGACCTCCA TAGAAGACAC CGGGACCGAT 4800 
15 CCAGCCTCCG CGGCCGGGAA CGGTGCATTG GAACGCGGAT TCCCCGTGCC 4 650 
AAGAGTGACG TAAGTACCGC CTATAGACTC TATAGGCACA CCCCTTTGGG 4 3 00 
TCTTATGCAT GCTATACTGT TTTTGGCTTG GGGCCTATAC ACCCCCGCTT 4 950 
CCXTATGCTA TAGGTGATGG TATAGCTTAG CCTATAGGTG TGGGTTATTG 50 00 
ACCATTATTG ACCACTCCCC TATTGGTGAC GATACTTTCC ATT ACT A AT C 5 050 

20 CATAACATGG CTCTTTGCCA CAAGTATCrC TATTGGCTAT ATGCCAATAC 5100 
TCTGTCCTTC AGAGACTGAC ACGGACTCTG TATTTTTACA GGATGGGGTC 5150 
CGATTTATTA TTT AC A&ATT CACATATACA ACAACGCCGT CCCCCGTGCC 5200 
CGCAGTTTTT ATTAAACATA GCGXGGGATC TGCACGCGAA TCTCGGGTAC 52-SC 
GTGTTCCGGA CATGGGCTCT TCTCCGGTAG CGGCGGftGCT TCCACATCCG S3 00 

25 AGCCCTGGTC CCATGCCTCC AGCGGCTCAT GGTCGCTCGG CAGCTCCTTG S3 SO 
CTCCTAACAG TGGAGGCCAG ACTT AGGGAC A GC AC A« TGC CCACCACCAC 54 00 
CAGTG.TGCCG CACAAGGCCG TGG CGGTAGG GTATGTGTCT GAAAATGAGC 54 50 
GTGGAGATTG GGCTCGCACG GCTGACGCAG ATGGAAGACT TAAGGCAGCG 55 00 
GCAGAAGAAG ATGCAGGCAG GTGAGTTGTT GTATTCTGAT AAGAGTCAGA 5550 

30 GGTAACTCCC GXT GCGGTGC TGTTAACGGT GGAGGGCAGT GTAGTCTGAG 56 00 
CAGTACTCGT TGCTGCCGCG CGCGCCACCA GACATAATAG C TGAC A G ACT 5650 
AAOAGACTGT TCCTTTCCAT GGGT CTT TTC TGCAGTCACC GTCTCGCGAC S700 
AGGGATCCAC CGGTCGCCAC CATGGTGCGC TCCTCGAAGA ACGTCATCAA 5750 
GGAGTTCATG CGCTTCAAGG TGCGCATGG A GGGCACCGTG AACGG CCA CG 5800 

35 AGTTCGAGAT CGAGGGCGAG GGCGAGGGCC GCCCCTACGA GGGCCACAAC 5 850 
AC CGTG AAGC TGAAGGTGAC CAAGGCCGGC CCCCTGCCCT TCGCCTGGGA 5900 
CATCCTGTCC CCCCAGTTCC AGTACGGCTC C AAGGYGT AC GTGAAGCACC 5S50 
CCGCCGACAT CCCCGACTAC AAGAAGCTGT CCXTCCCCGA GGGCTTCAAG 6000 
TGGGAGCGCG TGATGAACTT GGAGGACGGC GGCGTGGTGA CCGTGACCCA 6050 

40 GGACTCCTCC CTGCAGGAOG GCTGCTTCAT GTACAAGGTG AAGTT CAT CG 6100 
GCGTGAACTT CCCCTCCGAC GGCGCCGTAA TGCAGAAGAA GACCATGGGC 6150 
TGGGAGGCCT CCACCGAGCG OCTGTACCCC CGCGACGGCG TGCTGAAGGG 6500 
CGAGATCCAC AAGGCCCTGA AGCTGAAG3A CGGCGGCCAC TACCTGGTGG 6250 
AGTTCAAGTC CAT CT ACAT'G GCCAAGAAGC CCGTGCAGCT GCCCGQCT AC 63 00 

45 TACTAOGTGQ ACTCCAAGCT GGACATCACC TCCCACAACG AGGACTACAC 6350 
CAT CGTGGAG CAGTACGAOC GCACCGAGGG CCGCCACCAC CTGTTCCTGT 54 00 
AGCGGCCGCG ACTCTAGATC ATAATCAGCC ATACCACATT TGTAGAGGTT 64 5 0 
TTACTTGCTT TAAAAAACCT CCCACACCTC GCCCTGAACC TGAAACATAA 6S00 
AATGAATGCA ATTGTTGTTG 'E'TAACTTGTT TATTGCAGCT TATAATGGTT 6550 

50 ACAAATAAAG CAATAGCATC ACAAJvTTTGA CAAATAAAGC ATTTTTTTCA 6600 
CTGCATTCTA GTTGTGGCCC GGGCTGCAGG AATTCGATAT CAAGCTTATC 665 0 
GATACCGCTG ACCTCGAC-GG GGGGCCCGGT ACCCAATTCG CCCTATAGTG 6700 
AGTCGTATTA CGCGCGCTCA CTGGCCGTCG T TTT AC AAC G TCGTGA CTGG 6 750 
GAAAACCCTG GCGTTACCCA ACTTAATCGC CTTGCAGCAC ATCCCCCTTT 6 8 00 

55 CGCCAGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC 6850 
AGTTGCGCAG CCTGAATGGC GAATGGAAAT TGTAAGCGTT AATATTTTGT 6900 
T AAAATTCG C GTTAAATTTT TGTT AAATC A GCTCATTTTT TAACCAATAG 6950 
GCCGAAATCG GCAAAA-XCCC TTATAAATCA AAAGAATAGA CCGAGATAGG 7000 
GTTGAGTGTT GTTCCAGTTT GGAACAAGAG TCCACTATTA AAGAACGTGG 7050 

60 ACTCCAACGT CAAAGGGCGA AAAACCGTCT ATCAGGGCGA TGGCCCACTA 7100 
CTCCGGGATC ATATGACAAG ATGTGTATCC ACCTTAACTT AATC5ATTTTT 7 ISO 
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ACCAAAATCA TTAGGGGATT CATCAGTGCT CAGGGTCAAC GAGAATTAAC 72 00 
ATT C CGT C AG GAAAGCTTAT GATGATGATG TGCTTAAA&A CTTACTCAAT 725 0 
GGCTGGTTAT GCATATCGCA ATACATGCGA AAAAC C T AAA AGAGCTTGCC 73 00 
GAT&AA&AAG GCCAATTTAT TGCT&TTTAC CGCGGCTTTT TATTGAGCTT 73 5 0 
5 GAAAGATAAA TAAAATAGAT AGGTTTTATT TGAAGCTAAA TCTTCTTTAT 74 00 
CGTAAAAAAT GCCCTCTTGG GTTATGAAGA GGGTCATTAT ATTTCGCGGA 74 50 
ATAACATCAT TTGGTGACGA AATAACTAAG CACTTOTCTC CTGTTTACTC 7500 
CCCTGAGCTT GAGGGGTTAA CATOAAQQTC ATCGATAGCA GGATAATAAT 7550 
&CAGTAAAAC GCTAAACCAA T AAT CO AAAT CCAGCCATCC CAAATTGGTA 76 00 

10 GTCAATGATT ATAAATAACA GCAAACAGTA ATGGGCCAAT AACACCGGTT 7650 
GCATTGGTAA GGCTCACCAA TAATCCCTGT AAAGCACCTT GCTGATGACT 77 00 
eTTTGTTTGG ATAGACATCA CTCCCTGTAA TGCAGGTAAA GCGATCCCAC 7750 
CACCAGCCAA TAAAATTAAA ACAGGGAAAA CTAACCAACC TTCAGATATA 78 00 
AACGCTAAAA AGGCAAATGC ACTACTATCT GCAATAAATC CGAGCAGTAC 7 8 50 

15 TGCCGTTTTT TCGCCCATTT AGTGGCTATT CTTCCTGCCA C AAAGG CTTG 7 300 
GAATACTGAG TGTAAAAGAC CAAGACCCGT AATGAAAAGC CAACCATCAT 7 350 
GCTATTCATC ATCACGATTT CTGTAATAGC ACCACACCGT GCTGGATTGG 80 00 
CTATCAATGC GCTGAAATAA T AAT CAACAA ATGGCATCGT TAAATAAGTG 8050 
ATGTATACCG ATCAGCTTTT GTTCCCTTTA GTGAGGGTTA ATTGCGCGCT 8100 

20 TGGCGTAATC ATGGTCATAG CTGTTTCCTG TGTGAAATTG TTATCCGCTC 8150 
ACAArrCCAC ACAACATACG AGCCGGAAGC ATAAAGTGTA AAGCCTGGGG 82 00 
TGCCTAATGA GTGAGCTAAC T CAC ATT AAT TGCGTTGCGC TCACTGCCCG 825 0 
CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATGGGCCAA 83 0 0 
CGCGCGGGGA GAGGCGGTTT GCGT&TTGGG CGCTCTTCCG CTTCCTCGCT 83 50 

25 CACTGACTCG CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG GTATCAGCTC 34 00 
ACTCAAAGGC GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA 64 50 
AAGAACATGT GAG C A&AAGG G C AG C ASAAG GCCAGGAACC GXAAAAAGGC 85 00 
CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA 85 50 
AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA 85 00 

30 TACCAGGCGT TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC 8650 
CCTGCCGCTT ACCGGATACC TGTCCGCCTT TCTCCCTTCQ GGAAGCGTGG S700 
CGCTTTCTCA TAGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT 3750 
CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC CCGACCGCTG 3800 
CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT 3 8 SO 

35 TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT 3900 
GTAGGCGGTG CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC 3950 
TAGAAGGACA GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG 3000 
GAAAAAGAGT TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC 3050 
GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACG CGC AGAA AAAAAGGATC 3100 

40 TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG 9150 
AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC 92 00 
ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT 9250 
ATATGAGTAA ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAGGCAC 93 00 
CTATCTCAGC GATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC 93 50 

45 GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG GCCCCAGTGC 94 00 
TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA 94 SO 
TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAAGXTTA 9500 
TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA Q AOT AAGT AG 95 50 
TTCGCCAGTT AATAGTTTGC GCAACGTTGT TGCCATTGCT AC AGG CAT CG 96 00 

50 TGGTGTCACG CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA 9650 
CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGTTAG 97 00 
CTCCTTCGGT CCTCCGATGG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT 9750 
GACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC 9S00 
GTAAGATGCT TTTCTGTGAC TGGTGAGTAC TC AAC CAAGT CATTCTGAGA 98 50 

55 ATAGTGTATG CGGCGACCGA GTTGOTCTTG CCCGGCGTCA ATACGGGATA 9900 
ATACCGCGCC ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACGT S950 
TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC 10000 
GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA 10 050 
CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG 10100 

60 GGAATAAGGG CGACACGGAA ATGTTGAATA CTCATACTCX TCCTTTTTCA 101S50 
ATATTATTGA AGCATTTATC AGGGTTATTG TCTCATGAGC G GAT AC AT AT 10200 
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TTGA&TGTAT TTAG&AAAAT AAAC&AATAG GGGTTCCGCG CACATTTCCC 10250 
CGAAAAGTGC CAC 10263 



3 SEQ ID NO: 3 ( PTnMod (Oval/Red) Chicken) 

CTG&CGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG 5 0 
CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCTTCCCT TCCTTTCTCG CCACGTTGGG CGGCATCAGA TTGGCTATTG 150 

10 GC GATTGC'AT ACGTTGTATC CAT AT CAT A A TATGT ACATT TATATTGQCT 2 00 
CATGTCCAAC ATT AC CG CCA TGTTGACATT : QATTATTGAC TAGTTATTAA 2 50 
T ASTAAT CAA TTACGGGGTC ATTAGTTCAT AGCGCATATA TGG AGTT COG 3 00 
CGTXACATAA CTTACGOTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 3 50 
CCCGCCCATT GACGTCAATA ATG&CGT ATG TTCC C ATAGT AACGCCAATA 400 

15 GGGACTTTCC ATTGACG7CA ATGGGTGG&O T ATTT ACGG T AAACTGCCCA 4 50 
CTTGGO&GTA CATCAAGTGT ATC&T&TGGC AAGTACGCCC CCTATTGACG 5 00 
T CAATGtA CGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA 550 
TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 6 00 
CATGGTGATG C GGTTT TGG C AG TAG AT CAA TGGGCGTGGA TAGCGGTTTG 650 

20 ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 7 00 
TTTTGGCAGC AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACT CCGC 750 
CCCATTGACG OftAATGGGCG GTAGGCGTGY- ACGGTGGGAG GTCTATATAA 3 00 
GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG CCATCCACGC 850 
TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 500 

25 GGAACGGTGC &TTGGAAOGC GGATTCCCCG TGCCAAGAGT GACGTAAGTA 950 
CCGCCTATAG ACTCTATAGG CACACCCCTT TGGCTCTTAT GCATGCTATA 1000 
CTGTTTTTGG GTTGGGGGCT ATACACCCCC GCTTCCTTAT GCTATAGGTG 105 0 
ATGGT ATAGC TTAGCCTATA GGTGTGGGTT ATTGACCATT ATTGAC CACT 1100 
CCCCTATTGG TGACGATACT TT C C ATTACT i ■ : ■ AAT G CAT AA C ATGGCTCTTT 1150 

30 GCCACAACTA TCT CT ATTGG CTATATGCGA ATACTCTGTC CTTCAGAGAC 1200 
TG ACA CGG AC TCTGTATTTT TACAGGATGG GGTCCCATTT ATT ATT T AC A 125 0 
AATTCACATA TACAACAACG CCGTCCCCCG TGCCCGCAGT TTTTATTAAA 1300 
CATAGCGTGG GATCTCCACG CGAATCTCGG GT&CGTG PTC CGGACATGGG 135 0 
CTCTTCTCCG GT AGCGGCGG AGCTTC CACA TCCGAGCCCT GGTCCCATGC 1400 

35 CTCCAGCGGC TCATGGT CGC TCGGC AGCTC CTTGCTCCTA AC AGTG G AGO 145 0 
CO AGACTTAG GC A C AG CACA ATGCCCACCA CCACCAGTGT GCCGCACAAG 1500 
GCCGTGGCGG TAGGGTATGT GTCTGAAAAT GAGCGTGGAG ATTGGGCTCG 155 0 
CACGGCTGAC GCAGATGGAA GACTTAAGGC AGCGGC AGAA GAAGATGCAG 1600 
GCAGCTGAGT TGTTGTATTC TGATAAGAGT CAGAGGTAAC TCCCGTTGCG 165 0 

40 GTGCTGTTAA CGGTGGAGGG CAGTGTAOTC TGAGCAGTAC TCGTTGCTGC 170G 
CGCGCGCGCC AC C AG AC AT A AT AG CTG AC A GACTAACAGA CTGTTCCTTT 175 0 
CCATGGOTCT TTTCTGCAGT CA CCGT03GA CCATGTGTGA ACTTGATATT 160 0 
TTACATGATT CTCTTTACCA ATTCTGCCCC GAATTACACT T A AAA C GAC T 185 0 
CAACAGCTTA ACGTTGGCTT GCCACGCATT ACTTGACTGT AAAACTCTCA 1900 

45 CTCTTACCGA ACTTGGCCGT AACCTGCCAA CCAAAGCGAG AACAAAACAT 1950 
AACATCAAAC GAATCGACCG ATTGTTAGGT AATCGTCACC TCCACAAAGA 2000 
GCGACTCGCT GTATACCGTT GGCATGCTAG CTTTATCTGT TCGGGAATAC 2050 
GATGCCCATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG CAAAAACGAC 210 0 
TTATGGTATT GCGAGCTTCA GTCG CACTAC ACGGTCGTTC TGTTACTCTT 215 0 

50 TATGAGAAAG CGTTCCCGCT TTC&GAGCAA TGTTCAAAGA AAGCTCATGA 2200 
CCAATTTCTA GCCGACCTTG COAGCATTCT ACCGAQTAAC ACCACACCGC 2250 
TCATTGTCAG TGATGCTGGC TTTAAAGTGC CATGGTATAA ATCCGTTGAG 2300 
AAGCTGGGTT GGTACTGGTT AAGT GGAQTA AGAGGAAAAG TACAATATGC 235 0 
AGACCTAGGA GCGGAAAACT GGAA&COTAT CAGCAACTTA CATGATATGT 24 00 

55 CATCTAGTCA CTCAAAGACT .TTAGGCTATA AGAGGCTGAC TAAAAGCAAT 2450 
CCAATCTCAT GCCAAATTCT ATTGTATAAA TCTCGCTCTA AAGGCCGAAA 2500 
AAATCAGCGC TCGACACGGA CTCATTGTCA CCACCCGTCA CCTAAAATCT 2 550 
ACTCAGCGTC GGCAAAGGAG CCATGGGTTC TAGCAACTAA CTTACCTGTT 26 00 
GAAATTCGAA CACCCAAACA ACTTGTTAAT ATCTATTCGA AGCGAATGCA 26 SO 

60 GATTGAAGAA ACCTTCCGAG ACTTGAAAAG TCCTGCCTAC GGACTAGGCC 2700 
TACGCCATAG CCGAACGAGC AGCTCAGAGC GT TT TG AT AT CATGCTGCTA 275 0 
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ATCGCCCTGA 
GAAACAAGGT 
ACGTACTCTC 



10 



15 



20 



25 



30 



35 



55 



AAATTTATTC 
GATCACTTCT 
TTTTTTGTGG 
CCCTCCCCCG 
TTCCTAATAA 
CTATTCTGGG 
GACAATAGCA 
CTCTCTCTCT 
CTCTCTCTCT 
TTGACCCGGT 
CAATTACTCA 
CATCACAACA 
TATTTGAACA 
CCTATCCAAG 
TXAGCCTTGA 
ATCCAAGAGA 
GACCCTGAGC 
TTAAGGTGGA 
CACTCATTAG 
TGTGTGGAAT 

CATGATTACG 
GGAGCTCCAC 
AGGTCAGAAT 
A&CAATAGCT 
TCCCTCGAAC 
ATCTGCCAGG 
AGTTCATATC 
GAG CTATGTT 
ACACCCATAA 
CGTCTGCTCT 
TTATTTCTCC 
TTATGTCCTG 
TCAA&TG&AA 
ACTAACTAAX 
TTTCTGTTTT 
GAACATGAGC 
GATTAGGAGA 
T&TTTGCTCT 
CCCA&TCCCA 
GGAACCTGTG 
CGGATCTCCA 
TGTATTCAAG 
GCCCCATTGC 
GACAGGACCA 
CATGCGCTTC 
AGATCGAGGG 
AAGCTGAAGG 
GTCCCCCCAG 
ACATCCCGGA 
CGCCTGATGA 
CTCGCTGCAG 
AGTTCCCGTC 
GCCTCCACCO 
CCACAAGGCC 
AGTCCATCTA 

GTGGACTCCA 
GGAGCAGTAC 



TGCTTCAACT 
TGGGACAAGC 
AACAGTT CGC 
CAAGGGAAGA 
ACACATGGTT 
GGCTAATAAA 
ATCTGCTGTG 
TGCCTTCCTT 
AATGAGGAAA 
GGGTGGGGTG 
GGCATGCTQG 
CTCTCTCTCT 
CTCTCTCTCT 
G AC GAAAGGT 
G7GCCTGTTA 
AA&ACTGATT 
TTATCTTGAT 
AM3TG ATG CC 
ATACATT&CT 
ACCAACTTAA 
ACTGATGAAT 
TACACATCTT 
GCACCCCAGG 
TGTGAGCGGA 
CCAAGCGCGC 
CGCGGTGGCG 
GGTTTCTTTA 
TCTATAACTG 

CATGAACACT 
CCATTAAGTT 
AT AAA C A CAT* 

TTGCTGTATC 
AAAGATAGAT 
TCACTCTAGT 
TATTTTGTCA 
CCTAGCATGG 
CAGACTTCTG 
AATTGCTAAT 
CXTAAAGATC 
AATATTTCCC 
ACAGOCAGAA 
CCATTCAATC 
TTAAATGATT 
GGTGGGTCAC 
TGGGCTCCAT 
GAGCTCAAAG 
CAT CATGT C A 
QGGAATTCGT 
'AAGGTGCGCA 
CGAGGGCGAG 
TG&CC&AGGG 
TTCCAGTACG 
CTACAAGAAG 
ACTTCGAGGA 
GACGGCTGCT 
CGACGGCCCC 
AGGGCCTGTA 
CTGAAGCTGA 
CATGGCCAAG 
AGCTGGACAT 
GAG CG C AC CG 



AACATGTTGG 
ACTTCCAGGC 
TTAGGCATGG 
CTTACTCGTG 
ACGCTTTGGG 
AGATCAGAGC 
CCTTCTAGTT 
GACCCTGGAA 
TTGCATCGCA 
GGGCAGCACA 
GGATGCGGTG 
CTCTCTCTCT 
CTCTCTCTCT 
GCCTTTTATC 
TAAGCAGCAA 
TAACAAATGG 
TATATTATTG 
TATCATTGGT 
GGTAAGGTAA 
AGCTTTCCTG 
CGCCTAATGA 
GTCATATGAT 
CTTTACACTT 
TAACAATTTC 
AATTAACCCT 
GCCGCTCTAG 
CTGTTTGTCA 
AAATATATTT 
CCTCCAGCTG 
ATT CATGGAA 
TTGAAATTGA 
CTCAGAAAAA 
TTAAATATTC 
CTCAGTTGGC 
AGAAAATAAT 
CTCAGATGCA 
GTCTGTTACT 
TATGTTTTCC 
CCATTATCTG 
AGTCTTCTCT 
AACACATTGT 
CAAAATGGAC 
TCTATGGCGT 
AATTCAGGCT 
CGGTGCAGCA 
TCCACCATGC 
GCTCTAGCC& 
GCGCTCCTCC 
TGGAGGGCAC 
GGCCGCCCCT 
CGGCCCCCTG 
GCTCCA&GGT 
CTGTCCTTCC 

GTAATGCAGA 
CCCCCGCGAC 
AGGACGGCGG 
AAGCCCGTGC 
CACCTCCCAC 
AGGGCCGCCA 



CTTGCGGGCG 
TAACACAGTC 
AAGTTTTGCG 
GCTGCAACCC 
GAAATTATGA 
T CT AG AG AT C 
GCCAGCCATC 
GGTGCCACTC 
TTGTCTGAGT 
GCAAGGGGGA 
GGCTCTATGG 
CTCTCGGTAC 
CGGTACCAGG 

TTAATTATGA 
TTGGTCTGCC 
ATAATAATAA 

TGG.AATGAAC 
ACGCCATTGT 

ACGGAATGTT 
TTTTGGTAAA 
CCCGK3TAATG 
TATGCTTCCG 
ACACAGGAAA 
CACTAAAGGG 
AACTAGTGGA 
ATTCTATTAT 
GCTATTGTAT 

AATTTCACAA 
GATCTTTGAG 
GTATTGTTTT 
AAGTTTGTTA 
CAGCTATAGG 
TCCTTCACAT 
AGGTCACGTC 
GGTTGTAGAT 
ACAACCATAG 
ATCTCTAAGG 
GXTGTAACTG 
CCCATCCAAC 
TACCCAGAAT 



TTCATGCTCA 



XATTGAAAC 



ATATATTCCC 
AGCATGGAAT 
CAATGAGAAC 
TGGTATACCT 
AAGAACGTCA 
CGTGAACGGC 
ACGAGGGCCA 
CCCTTCGCCT 
GTACGTGAAG 
CCGAGGGCTT 
GTGACCGTGA 



AGAAGACCAT 
GGCGTGCTGA 
CCACTACCTG 
AGCTGCCCGG 



CCACCTGTTC 

no 



T ACT AG CT CA 
TAATGATCCA 
TGTGTGTTGG 
TGTTGTTTGC 
CCACTGTCCT 
AGGTGTCATT 
GGATTGGGAA 
GTACCTCTCT 
CTCTCTCTCT 
TGCTGAAGAA 
AAATAAAAAA 
TTGATGCCTA 
TTAGAAAGTA 
AAACCTTATC 
TTGAAAAAAA 
CAGCAAATTG 
AATTCTCGTT 

AATCATTAAG 
TGAGTTAGCT 

GCrGGTATGT 
CAGCTATGAC 
AACAAAAGCT 

TTCAATACAG 
ATTATGATTG 
TTCCTCTGTC 
GAACACTGCA 
GCATTGTATG 
TAAAGCATTC 
AAAGAAAGTG 
GCATGCTTCT 
TTGTTCTCAC 
ACAAGAAGGA 
TAATAAGCAC 
TT C C C AC ATT 
AAGCTCAATG 
AGTCCTGATG 
TAAAAACTAA 
TAAAATCTAA 
ACTTCTGAAG 
CAGGGCTCAG 

TTTGTTTTOA 
ATCTTCTACT 
GGGTGCAAAA 
TC AAGGAG TT 
CACGAGTTCG 
CA AC A C C G TG 
GGGACATCCT 
CACCCCGCCG 
CAAGTGGGAG 
CCCAGGACTC 
ATCGGCGTGA 
GGGCTGGGAG 
AGGGCGAGAT 
GTGGAGTTGA 
CT ACT ACT AC 
ACACCATCGT 
CTGTAGCGGC 



2800 
2850 
2900 
2950 
3000 
3050 
3100 
3150 
3200 

3 2 SO 
3300 
3350 
3400 
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3500 
3550 
3600 
3650 
3700 
3750 
3800 
3650 
3300 
3950 
4000 
4050 
4100 
4150 
4200 
4250 
4300 
4350 
4400 
4450 

4 5 00 
4550 
4600 
4650 
4700 
4750 
4800 
4S50 
4900 
4950 
SO 00 
5050 
5100 
5150 
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5300 
5350 
5400 
5450 
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5550 
5600 
5650 
5700 
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CGCGACTCTA G AT C AT AAT C AGCCATACCA CATTTGTAGA GGTTTTACTT S850 
GCTTTAAAAA ACCTCCCACA CCTGCCCCTG AACC7GAAAC ATAAAATGAA 5900 
TGCAATTGTT GTTGTTAACT TGTTTAXTGC AGCTTATAAT GGTTACAAAT 5 950 
AAAGCAATAG CATCACAAAT: TTCACAMtft AAGCATTTTT TTCACTGCAT 5000 
5 TCTAGTTGTG G CT CG A G AA G GGCGAATTCT GCAGATATCC ATCACACTGG 6050 
CGGCCGCTCG AGGGGGGGCC CGGTACCCAA TTCGCCCTAT AGTGAGTCGT 6100 
ATTACGCGCG CTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGG A AA AC 6150 
CCTGG CGTT A CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTCGCCAG 6200 
CTGGCGTAAT AGCGAAGAGG CCGGCACCGA TCGCCGTTCC CAACAGTTGC 6 250 
10 GCAGCCTGAA TGGCGAATG3 AAATTGTAAG CGTTAATAT7 TTGTTAAAAT 63 00 
TC G CGTTAAA TTTTTGTTAA ATCAGCTCAT TTTTTAACCA ATAGGCCGAA 63 50 
ATCGGCAAAA TCCCTTATAA ATCAAAAGAA TAGACCGAGA TAGGGTTGAG 64 00 
TGTTGTTCCA GTTTGGAACA AGAGTCCACT AT X AAA G AAC GTGGACTCCA 64 SG 
&CGTC A&AGG GCGAAAAACC GTCTATCAGG GCGATGGCCC ACTACTCCGG 65 00 
15 GAT CAT AT G A CAAGATGTGT ATCCACCTTA ACTTAATGAT TTTTACCAAA 65 50 
ATCATTAGGG GATTCATCAG TGCTCAGGGT C A ACQ AG AAT TAACATTCCG 6600 
TCAGGAAAGC TTATGATGAT GATGTGCTTA AAAACTTACT CAATGGCTGG 6650 
TTATGCATAT CGCAATACAT GCGAAAAACC TAAAAGAGCT TGC CGATAAA 6700 
AAAGGCCAAT TTATTGCTAT TTACCGCGGC TTTTTATTGA GCTTGAAAGA 67 50 
20 TAAATAAAAT AGATAGGTT7 TATTTGAAGC TAAATCTTCT TTATCGTAAA S8 00 
AAATGCCCTC TTGGGTTATC AAGAGGGTCA TTATATTTCG GGGAATAACA 5850 
TCATTTGGTG ACGAAATAAG TAAGCACTTG TCTCCTGTTT ACTCCCCTGA 6900 
GCTTGAGGGG TTAACATGAA GGTCATCGAT AGCAGGATAA T AAT A C AGTA 6950 
AAACGCTAAA CCAATAATCC AAATGCAGCC ATCCCAAATT GGTAGTGAAT 7000 
25 GATTATAAAT AACAGCA&AC AGTAATGGGC CAATAACACC GGTTG C ATTG 7050 
GTAAGGCTCA CCAATAATCC CTGTAAAGCA CC7TGCTGAT GAC7CTT7GT 7100 
TTGGATAGAC ATCACTCCCT GTAATGCAGG TAAAGCGATC CCACCACCAG 7130 
CCAATAAAAT TAAAACAGGG AAAACTAACC AACCTTCAGA TATAAACGCT 7200 
AAAAAGGCAA ATGCACTACT ATCTGCAATft AATCCGAGCA GTACTGCCGT 7 2 SO 
30 TTTTTCG CCC CATTTAGTGG CT ATT CTTCC TGCCACAAAG GCTTGGAATA 7300 
CTGAGTGTAA AAGACCAAGA CCCGOTAATG AAAAGCCAAC CATCATGCTA 7350 
TTCCATCCAA Aft CGATTTTC GGTAAATAGC ACCCACACCG TTGCGGGAAT 74 00 
TTGGCCTATC AATTGCGCTG AAAAATAAAT AATCAACAAA ATGGCATCGT 7450 
TTTAAATAAA GXGATGTATA GCGAATTCAG CTTTTGTTCC CTTTAGTGAG 7S0G 
35 GGTTAATTGC GCGCTTGGCG TAATCATGGT GAT AG CTGTT TCCTGTGTGA 75 SO 
AATTGTTATC CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA 7 600 
GTGTAAAC-CC TGGGGTGCCT AATGAGTGAG CTAACTCACA 1TAATTGCGT 7 650 
TGCGCTCACT GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG CCAGCTGCAT 7700 
TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC 7750 
40 TTCCGCTTCC TCGCTCACTG AGTGGGTGCG CTCGGTCOTT COOCTGCGOC 7600 
GAGCGGTATC AGCTCACTCA AAGGGGGTAA TACGGTTATC CACAGAATCA 7 650 
GGGGATAACG CAGG&AAGAA GATGTGAGCA AAAGGCCAGC AAAAGG C C AG 7 900 
GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTC C ATAQG CTCCGCCCCC 7950 
CTGACG&GCA TCACAAAAAT CGACGCTCAA GT CAGAGGTG GCGAAACC CG 6 000 
45 ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG 8 050 
CTCTCCTGTT CCGACCCTGC CCCTTACCGG ATACCTCTCC GCCTTTGTCC S 100 
CTTCGGGAAG CGTGGCGCTT TCTCATAGCT CACGCTGTAG GTATCTCA3T 6150 
TCGGTGTAGG TCGTTCGGTG CAAGCTGGGC TGTGTGCACG AACCCCCCGT 8200 
TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATGGTCTT GAGTCCAACC 8250 
50 CGGTAAGACA CG A CTT AT CG CCACTGGCAG CAGCCACTGG TAACAGGATT 8300 
AGCAGAGCGA GGTATGTAGG CGGTGCTACA G AGTT CTTGA AGTGGTGGCC 8350 
TAACTACGGC T AC AC T AG AA GGACAGTATT TGGTATCTGC GOT CTG CTG A 8400 
AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTGTTGATC CGGCAAACAA 8450 
ACCACCGCTG GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AGATTACGCG 8500 
55 CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT ACGGGOTCTG 8550 
ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA 8«00 
T C AAA AAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA 8650 
ATCAATCTAA AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT 8 700 
TAATCAGTGA GGGACCTATC TCAGCGATCT GTCTATTTCG XT CAT C C ATA 6 750 
O0 GTTGCCTGAC TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC 3800 
ATCTGGCCCC AGTGCTGCAA TGATACCGCG AGACCCACGC TCACCGGCTC 6650 
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CAGATTTATC AGCAATAAAC CAGCGAGCCG GAAGGGCCGA GCGCAGAAGT 3900 

GGTCCTGCAA CTTTAT CCGC CTCCATCCAG Td'ATTAATT GTTGCCGGGA 8950 

AGCTAG&GTA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA 9000 

TTGCTACAGG CAT CGTGGTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC 5050 

5 AGCTCCGGTT CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG £100 

CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GA7CGTTGTC AGAAGTAAGT 0150 

TGGCCGCAGT GTT AT C ACTC ATGGTTATGG CAGCACTGCA TA&TTCTCTT 9200 

ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCA&C 925 0 

CAAGTCATTC TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG 9300 

10 CGTCAATACG GGATAATACC GCGCCACATA GCAGAACTTT A&AAGTGCTC 93 SO 

ATCATTGGAA AACGTTCTTC GGGGCGAA&A CTCTCAAGGA TCTTACCGCT 94 0 0 

GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC TGATCTTCAG 9450 

CATCTTTTAC TTTC&CCAGC GTTTCTGGGT GAGCA&AAAC AGGAAGGCAA 9500 

AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT GAATACTCAT 9550 

15 ACTCXTCCTT TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTOTCTCA 9600 

TGAGCGG&TA CATATTTGAA TGTATTTAGA AAAATAAACA AATAGGGGTT 9650 
CCGCGCACAT TTCCCCGAAA AGTGCCAC 967 8 



20 SEQ ID NO? 4 (PTnMod {Oval/Red) Quail) 

CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG 50 
CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCTTCCCT TCCTTTCTCG CCACGTTCGC CGGCATCAGA TTGGCTATTG 150 

25 GCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT TATATTGGCT 200 
CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC TAGTTATTAA 250 
TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG 3 00 
CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACGG CCCAACGACC 35 0 
CCCGCCCATT GAGGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 4 00 

30 GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 43 0 
CTTGGCAGTA CAT CAAGTGT ATCATATGCC AAGTACGCCC CCTATTGACG S00 
TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA SS0 
TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 6 00 
CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 6 SO 

35 ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TQGGAGTTTG 7 00 
TTTTGGCACC AA&ATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC 7S0 
CCCATTGACG C&AATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTA T AT AA 300 
GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG CCATCCACGC 850 
TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 900 

40 GGAACGGXGC ATTGGAACGC GGATTCCCCG TGCCAAGAGT GACGTAAGTA 950 
CCGCCTATAG ACT CT AT AGG CACACCCCTT TGGCTCTTAT GCATGCTATA 1000 
CTGTTTTTGG CTTGGGGCCT ATACACCCCC GCTTCCTTAT GCTATAGGTG 1050 
ATGGTATAGC TTAGCCTATA GGTGTGGGTT ATTGACGATT ATTGACCACT 1100 
CCCCTATTGG TGACGATACT TTCCATTACT AATCCATAAC ATGGCTCTTT 1150 

45 GCCACAACTA TCTCTATTGG CTATATGCCA AT ACT CTGTC CTTCAGAGAC 12 00 
TGACACGGAC TCTGTATTTT TACAGGATGG GGTCCCATTT ATTATTTACA 1250 
AATTCAGATA TACAACAACG CCGTCCCCCG TGCCCGCAGT TTTTATTAAA 13 00 
CATAGCGTGG GATGTCCACG CGAATCTCGG GTACGTGTTC CGGACATGGG 13 50 
CTCTTCTCCG GTAGCGGCGG AGCTTCCACA TCCGAQCCCT GGTCCCATGC 14 00 

50 CTCCAGCGGC TCATGGTCGC TCGGCAGCTC CTTGCTCCTA ACAGTGGAGG 1450 
CCAGACTTAG GC AC AG CAC A ATGCCCACCA CCACCAGTGT GCCGCACAAG 150 0 
GCCGTGGCGG TAGGGTATGT GTCTGAAAAT GAGCGTGGAG ATTGGGCTCG 1S50 
CACGGCTGAC GCAGATGGAA GACTTAAGGC AGCGGCAGAA G AAGATG C AG 16 00 
GCAGCTGAGX TGTTGTATTC TGATAAGAGT CAGAGGTAAC TCCCGTTGCG 1650 

55 GTGCTGTTAA GGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTTGCTGC 1700 
CGCGCGCGCC ACCAGACATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 1750 
CCATGGGTCT TTTCTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT 18 00 
TTACATGATT CTCTTTACCA ATTCTGCCCG GAATTACACT T A AAACG A CT 1850 
CAACAGCTTA ACGTTGGCTT GCCACGCATT ACTTGACTGT AAAACTCTCA 1900 

60 CTCTTACCGA ACTTGGCCGT AACCTGCCAA CCAAAGGGAG AACAAAACAT 1950 
AACATCAAAC GAATCGACCG ATTGTTAGGT AATCGTCACC T C C AC AAAG A 2000 
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10 



15 



20 



2 5 



30 



35 



40 



SO 



55 



60 



GCGACTCGCT 
GATGCCCATT 
TTATOGTATT 
TATGAGAAAG 
CCAATTTCTA 
TCATTGTCAG 
AA G CTGGSGTT 
AGACCTAGGA 
CATCTAGTCA 
CCAATCTCAT 
AAATCAGCGC 
AC T CAGCGTC 
GAAATTCGAA 
GATTGAAG&A 
TACGCCAT&G 
ATCGCCCTGA 
GAAACAAGGT 
ACGTACTCTC 
TACACAATAA 
AAATTTATTC 
GATCACTTCT 
TTTTTTGTGG 
CCCTCCCCCG 
TTCCTAATAA 
CT ATT CTGGG 
GACAATAGCA 
CTCTCTCTCT 
CTCTCTCTCT 
TTGACCCGGT 
CAATT ACT CA 
CATCACAACA 
TATTTGAACA 
CCTATCCAAG 
TTAGCCTTGA 
ATCCAAGAGA 

GACCCTGAGC 
TTAAGGTGGA 
CACTCATTAG 
TGTGTGGAAT 
CATGATTACG 
GGAGCTCCAC 
AGGTCAGAAT 
AACAAAAGCT 
TCCCTCGAAC 
ATCTGCCAGG 
CAT AC CATAA 
TATGTTTTGC 
CCATCA&AAG 
TCTTCACTCT 



CCTATTTTGT 
TGCCTAGCAT 
AACAGACTTC 
ATAATTGCTA 
TTAAGATCCC 
TATTTCTCAG 



GTATACCGTT 

GTACTTGTTG 

GCGAGCTTCA 

CGTTCCCGCT 

GCCGACCTTG 

TGATGCTGGC 

GGTACTGGTT; 

GCGGAAAACT 

CTCAAAGACT 

GCCAAAXTCT 

TCGACACGGA 

GGCAAAGGAG 

CA C C A^i^^.GA 

ACCTTCCGAG 

CCG&ACGAGC 

TGCTTCAACT 

TGGG&CA&GC 

AACAGTTCGC 

C&AGGGAAGA 

ACACATGGTT 

GGCTAATAAA 

ATCTGCTGTG 

TGCCTTCCTT 

AATGAGGAAA 



GGGATGCTAG 
ACTGGTCTOA 
GT CG C ACT AC 
TTCAGAGCAA 
CG AG C ATT GT 
TTTAAAGTGC 
AAGTCGAGTA 
GGAAACCTAT 
TTAGGCTATA 
ATTGTATAAA 
CTCATTGTCA 
CCATGGGTTC 
ACTTGTTAAT 
ACTTGAAAAG 
AGCTCAGAGC 
AACATGTTGG 
ACTTCCAGGC 
TTAGGCATGG 
CTTACTCGTG 
ACGCTTTGGG 
AGATGAGAGC 



GGv? 



^TGGSGTG 



GGCATGCTGG 
CTCTCTCTCT 
CTCTCTCTCT 
GACCAAAGGT 
GTGCCTGTTA 
AAAACTGATT 
TTATCTTGAT: 
ftAGTGATGC C 
ATACATTACT 
ACCAACTTAA 
ACTG&TGAAT 
TACACATCTT 

GCACCCCAGG 
TGTGAGCGGA 
CCAAGCGCGC 
CGCQGTGGCQ 
GGTTTCTTTA 
TCTATAACTG 
CATGAACACT 
CTGGAAG&TC 
ACTCATTTGG 
AGTTCCCTCA 

ATATATTTAA. 

GAT CTCAGTT 

CAAGAAAATA 

GGCTTAGATG 

TGGTCTGTTA 

ATTATGTTTT 

ATTATCTGGT 

TCTTTTCTCC 



GACC.OTGGAA 
TTGCATCGCA 
GGGCAGCACA 
GGATGCGGXG 
CTCTCTCTCT 
CTCTCTCTCT 



G 



TAAGCAGCAA 
TAA C AAATGG 
TATATTATTG 
TATCATTGGT 
GGXAAGGTAA 
AGCTTTCCTG 



~AATGA 



GAAAACACTT TOTT ACCCA< 
ATCCAAAATG 
ATTTCTATGG 



CCCAATTCAG 
CTCGTGCAGC 
GTCCACCATG 



GACCTATTGA 
CGTCAAAGGT 
GCTATATATT 
AAGCATGGAA 
CCAATGACAA 



GTCATATGAT 
CTTTACACTT 
TAACAATTTC 
A ATT A AC CCT 
GCCGCTCTAG 
CTGTTTGTCA 
AAATATATTT 
CCTCCAGCTG 
ATGGAAGATC 
AATTGAGTAT 
GAAGAAAAGC 
AT &TT.OC A AC 
GGTTTCTTCA 

ATAGGXCAAG 
CACGTTGTAC 
CAACAACCAT 
CC AT CTCTAA 
TGTAACTGAA 
AGCAATCCTC 
AATTAAAAAC 
AACTAAAATC 
CAAACTTTTG 
CCGCAGGGGT 
TTTTGCCTTG 
CATGCTCTAC 



CTTTATCTGT 
TATTCGTGAG 
ACGGTCGTTC 
TCTTCAAAGA 
ACCGAGTAAC 
CATGGTATAA 
AG AGG AAAAG 
CAGCAACTTA 
AG&GGCTGAC 
TCTCG CTCTA 
CCACCCGTCA 
T AGG AACT AA 
ATCTATTCGA 
TCGTGCCTAC 
GTTTTGATAT 

TAACACAGTC 
AAGTTTTGCG 
GCTGCAACCC 
GAAATTATGA 
TCTAGAGATC 
GCCAGCCATC 
GGTGCCACTC 
TTGTCTGAGT 
GCAAGGGGGA 
GGCTCTATGG 

X C i Cv^-r A AC 

CGGTACCAGG 

ATCACTTTAA 

TTAATTATGA 

TTGGTCTGCC 

ATAAT AATAA 

TGGAATGAAC 

ACGCCATTGT 

ACGGAATGTT 

TTTTGGTAAA 

CCCGGTAATG 

TATGCTTCCG 

ACACAGGAAA 

CACTAAAGGG 

AACTAGTGGA 

ATTCTATTAT 

GCTATTGTAT 

AATTTCACAA 

TCTGAGGAAC 

TATTTTGCTT 

TTGTTATAAA 

TACAGAAAGA 

CGTACATGCT 

TCCTGTTCTC 

ATTCAAGAAG 

AGTAATAAAC 

GGTTGCCACA 

GCTCAATGGA 

ACGGATTAGA 

TAATATTTGC 

TGACCCAATC 

AAGGGAACCT 

GAGCGGATCT 

ATGTATTGAA 

TCCCCCTTTG 
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TCGGGAATAC 
C A AAA A C G A C 
TGTTACTC1T 
AAGCTCATGA 
ACCACACCGC 
ATCCGTTGAG 
TACAATATGC 
CATGATATGT 
TAAAAGCAAT 
AAGGCCGAAA 
CCTAAAATCT 
CTTACCTGTT 
AGCGAATGCA 
GGACTAGGCC 
CATGCTGCTA 
TTCATGCTCA 
AGAAATCGAA 
GCATTCTGGC 
TACTAGCTCA 
TAATGATCCA 
TGTGTG TTGG 
TGTTGTTTGC 
CCACTGTCCT 
AGGTGTCATT 
GGATTGGGAA 
GTACCTCTCT 
CTCTCTCTCT 
TGCTGAAGAA 

TTGATGCCTA 
TTAGAAAGTA 
AAACCTTATC 
TTGAAAAAAA 
CAGCAAATOG 
AATTCTCGTT 
AATCATTAAG 
TGAGTTAGCT 
GCTCGTATGT 
C AG CTATGAC 
AACAAAAGCT 
TCCCCCGGGG 
TTCAATACAG 
ATTATGATTG 
TTCCTCTGTC 
ATTGCAAGTT 
TGAATGGAGC 
GCGTCTAGftG 
TTTTGTCTGC 
T CTTTATTTG 
ACTTATCTCC 
GATCAAATGA 
AGACTAACTA 
TTTTTCTGTT 
ACATGAACAG 
AGAACTGGCA 
TCTCCCTTCA 
CCATTAAATT 
GTGGGTGGGT 
CCATGGGCTC 
GGAGCTCAAA 
CCATCTGTCA 
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ACTCTGGCCA TGGTCTCCCT GGGTGCAAAA GACAGCACCA GGGAATTCGT 5100 
GCGCTCCTCC AAGAACGTCA TCAAGGAGTT CATGCGCTTC AAGGTGCGCA 51S0 
TGGAGGGCAC CGTGAAGGGC CACGAGTTCG AGATCG&GGG CGAGGGCGAG 5200 
GGCCGCCCCT ACGAGGGCCA CAACACCGTG AAGCTGAAGG TGACCAAGGG 5250 
5 CGGCCCCCTG CCCTTCGCCT GGGACATCCT GTCCCCCCAG TTCCAGTACG 53 0 0 
GCTCCAAGGT GTACGTGAAG CACCCCGCCG ACATCCCCGA CTACAAGAAG S3 50 
CTGXCCTTCC CCGAGGGCTT CAAGTGGGAG CGCGTGATGA ACTTCGAGGA 54 00 
CGGCGGCGTG GTGACOGTGA CCCAGGACTC CTCCCTGCAG GACGGCTGCT 5450 
TCATCTACAA GGTGAAGTTC ATCGGCGTGA ACTTCCCCTC CGACGGCCCC 550 0 

10 GTAATGCAGA AGAAGACCAT GGGCTGGGAG GCCTCCACCG AGCGCCTGTA 555 0 
CCCCCGCGAC GGCGTGCTGA AG30CGAGAT CCACAAGGCC CTGAAGCTGA 5600 
AGGACGGCGG CCACT&CCTG GTGGAGTTCA AGTCCATCTA CATGGCCAAG 5S5G 
AAGCCCGTGC AGCTGCCCGG CTACTACTAC GTGGACTCCA AGCTGGACAT 57 00 
CACCTCCCAC AACGAGGACT ACACCATCGT GGAGCAGTAC GAGCGCACCG 575 0 

15 AGGGCCGCCA CCACCTGTTC CTGTAGCGGC CGCGACTCTA GATCATAATC 5 BOO 
AGCCATACCA CATTTGTAGA GGTTTTACTT GCTTTAAAAA ACCTCCCACA 5850 
CCTCCCCCTG AACCTGAAAG ATAAAATGAA TGCAATTGTT GTTGTTAACT 5500 
TGTTTATTGC AGCTTATAAT GGTTACAAAT AAAGCAATAG CATCACAAAT 595 0 
TTCACAAATA AAGCATTTTT TTCACTGCAT TCTAGTTGTG GCTCGAGAAG 6000 

20 GGCGAATTCT GCAGATATCC ATCACACTGG CGGCCGCTCG AGGGGGGGCC 5050 
CGGTACCCAA TTCGCCCTAT AGTG AG T CGT ATTACGCGCG CTCACTGGCC SI 00 
GTCGTTTTAG AACGTCGTG& CTGGGAAA&C CCTGGCGTTA CCCAACTTAA 6150 
TCGCCTTGCA GCACATCCCC CTTTGGCCAG CTGGCGTAAT AG CG AAG AG G 620 0 
CCCGCACCGA TCGCCCTTCC CAACAGTTGC GCAGCCTGAA TGGCGAATGG 625 0 

25 AAATTGTAAG CGTTAATATT TTGTTAAAAT TCGCGTTAAA TTTTTGTTAA 63 00 
AT C AG CT CAT TTTTTAACCA ATAGGCCGAA AT CGGCAAAA TCCCTTATAA 635 0 
ATCAAAAGAA TAGAGCGAGA TAGGGTTGAG TGTTGTTCCA GTTTGGAftCA 6400 
AGAGTCCACT ATTAAAGA&C GTGGACTCCA ACGTCAAAGG GCGAAAAACC 6450 
GTCTATCAGG GCGATGGCCC ACTACTCCGG GATCATATGA CAAGATGTGT 65 0 0 

30 ATCCACCTTA ACTTAATGAT TTTTACCAAA ATCATTAGGG GATTCATCAG 6550 
TGCTCAGGGT CAACG AGAAT TAACATTCCG TCAGGAA&GC TTATGATGAT 660 0 
GATGTGCTTA AAAACTTACT CAATGGCTGG TTATGCATAT CGCAATACAT 6650 
GCGAAAAACC TAAAAGAGCT TGCCGATAAA AAAGGCCAAT TTATTGCTAT 670 0 
TTACCGCGGC TTTTTATTGA GCTTGAAAGA TAAATAAAAT AGATAGGTTT 675 0 

35 TATTTGAAGC TAAATCTTCT TTAT CGT AAA AAATGCCCTC TTGGGTTATC 6800 
AAG AGGGT C A TTATATTTCG CGGAATAACA TCA7TTGGTG ACGAAATAAC 6850 
TAAGC&CTTG TCTCCTGTTT ACTCCCCTGA GCTTGAGGGG TTAACATGAA €900 
GGTCATCGAT AGCAGGATAA TAATACAGTA AAACGCTAAA CCAATAATCC 695 0 
AAATCCAGCC ATCCCAAATT GGTAGTGAAT GATTATAAAT AACAGCAAAC 700 0 

40 AGTAATGGGC CAATAACACC GGTTGCATTG GTAAGGCTCA CCAATAATCC 7 050 
CTGTAAAGCA CCTTGCTGAT GACTCTTTG? TTGGATAGAC ATCACTCCCT 7100 
GTAATGCAGG TAAAGCGATC CCACCACCAG CCAATAAAAT TA&AACAGGG 715 0 
AAAACTAACC AACCTTCAGA TATAAACGCT AAAAAGG C AA ATGCACTACT 7 200 
ATCTGCAATA AATCCGAGCA GTACTGCCGT TTTTTCGCCC CATTTAGTGG 725 0 

45 CTATTCTTCC TGCGACAAAG GCTTGGAATA CTGAGTGTAA AAG AC C AAG A 73 0 0 
CCCGCTAATG AAAAGCCAAC CATCATGCTA TTCCATCCAA AACGATTTTC 735 0 
GGTAAATAGC ACCCACACCG TTGCGGGAAT TTGGCCTATC AATTGCGCTG 74 00 
AAAAATAAAT AATCAACAAA ATGGCATCGT 7TTAAATAAA GTGATGTATA 74 50 
CCGAATTCAG CTTTTGTTCC CTTTAGTGAG GGTTAATTGC GCGCTTGGCG 750 0 

50 TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 755 0 
TCGACACAAC ATACGAGCCG GAAG CAT AAA GTGTAAAGCC TGGQGTGCCT 76 0 0 
AATGAGTGAG CTA&CTC&CA TTAATTGCGT TGCGCTCACT GCCCGCTtTC 76 5 0 
CAGTCGGGAA ACGTGTGGTG CCAGCTGCAT TAATGAATCG GCCAACGCGC 77 0 0 
GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCXTCC TCGCTCACTG 775 0 

55 ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA 7800 
AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG CAGGAAAGAA 785 0 
CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT 7900 
TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT 7 95 0 
CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA 8 000 

60 GGCGTTTCCC CCTGGAAGCT CCGTCGTGCG CTCTCCTGTT CCGACCCTGC 6050 
CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 6100 
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TCTCATAGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC 8150 
CAAGCTGGGC TGTGTGCACG AACCCCCCGT XCAGCCCGAC CGCTGCGCCT 82 00 
TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG &2S0 
CCACTGGCAG CAGCCACTGG TAACAGG&TT AGCAGAGCGA GGTATGTAGG 83 00 
5 CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTA CGGC TACACTAGAA 8350 
GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCX5GAAAA 84 00 
&GAGTTGGTA GCTCTTGATC CGGCAA&CAA ACC&CCGCTG GTAGCGGTGG 84 50 
TTT7TTTGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG 85 00 
AAGATCCTTT GATCTTTTCT ACGGGGTCTG &CGCTCAGTG GAACGA&AAC 8550 

10 TCACGTTAAG GGATTTTGGT CATGAGATTA TCAAAAAGGA TCTTCACCTA 86 00 
G&TCCTTTTA AATT&AAAAT GA&GTTTTAA ATCAATCTAA AGTATATATG 8650 
AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC 87 00 
TCAGCOATCT GTCTATTTCG TTCATCC&TA GTTGC C TG AC TCCCCGTCGT 8750 
GTAGATAACT ACGAT&CGGG AGGGCTTACC ATCTGGCCCC ACTGCTGCA& 8 8 00 

15 TGATACCGCG AGACCCACGC TCACCGGCTC C AG AT TTATC AGCAATAAAC 80 50 
CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA CTTTATCCGC 8900 
CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC 8950 
CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 9000 
TCACGCTCGT CGTTXGGT AT GGCTTCATTC AGCTCCGGTT CCCAACGATC 9050 

20 AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTT&GCTCCT 9100 
TCGGTCCT CC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC S1S0 
ATGGTTATGG CAGCACTGCA TAATTCTCTT ACTGTCATGC CATCCGTAAG 9200 
ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC TGAGAATAGT 92 50 
GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC 53 00 

25 GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACCTTCTTC 9350 
GGGGCGAAA& CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT 94 00 
A&CCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC 9450 
GTTTCTGGGT GAG CAAAAAC AGGAAGGCAA A&TGGCGGAA AAAAGGGAAT 9500 
AAGGGGGACA CGGAAATOTT G.&ATACTCAT ACTCTTCCTT ITT C AAT ATT 9 5 SO 

30 ATTGAAGCAT TTATCAGGGT TATTGTCTGA TGAGCGGATA CATATTTGAA 5600 
TGTATTTAGA AAAATAAACA AATAGOGGTT CCGCGCACAT TTCCCCGAAA 3650 
AGTGCCAC 9658 



35 SEQ ID NO: S (spacer) 
(G£GG5 X 

SEQ ID NO j 6 (spacer) 
GFGGGPGGGPGG 

40 

SEQ ID NGj7 (spacer) 
GGGGSGGGG8GGGGS 

45 

SEQ ID NO; 6 (spacer) 
GGGGSGGGGSGGGGSGGGG8 



50 SEQ ID NQiS {enterokijiase cleavage site) 

VDDDK 



SEQ ID NOslO {altered transposase Hef forward primer) 
55 AT CTCG AG AC C ATGTGTG AACTTG AT ATTTT ACATGATTCT CTTTACC 



SEQ ID NO ill (altered transposes© Her reverse primer) 
G ATTG ATC ATT AT C AT AAT TT CCCC AAAG CGTAACC 

60 
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SEQ ID NO: 12 (Xho I restriction site) 
CTCGAG 



SEQ ID &Osl3 (modified Kozak sequence) 
ACCATG 



SEQ ID NO: 14 (Bel I restriction site) 
TGATGA 



SEQ ID NO* 15 {CMVf-NgoM XV primer) 
TTG C CGGC ATCAG ATTGG CTAT 



SEQ ID NO j 16 (Syn-polyAr-BstE II primer) 
AG&GGTC ACCGG GTCAATT CTT CAG C ACCTGGTA 



SEQ ID NO: 17 (vitellogenin promoter) 

TGAATGTGTT CTTGTGTTAT CAATATAAAT CACAGTTAGT GATGAAGTTG GCTGCAAGCC 
TGCATCAGTT CAGCTACTTG GCTGCATTTT GTATTTGGTT CTGTAGGAAA TGCAAAAGGT 
TCTAGGCTGA CCTGCACTTC TATCCCTCTT GCCTTACTGC TGAGAATCTC TGCAGGTTTT 
AATTGTTCAC ATTTTGCTCC CATTTACTTT GGAAGATAAA ATATTTACAG AATGCTTATG 
AAACCTTTGT T C ATTTAA A A ATATTCCTGG TCAGCGTGAC CGGAGCTGAA AGAACACATT 
GATCCCGTGA TTTCAATAAA TACATATGTT C C AT ATATTG TTTCTCAGTA G COT CTX AAA 
TCATGTGCGT TGGTGCACAT ATGAATACAT GAATAGCAAA GGTTTAT CTG OATTACGCTC 
TGGCCTGCAG GAATGGCCAT A&ACCAAAGC TGAGGGAAGA GGGAGAGTAT AGTCAATGTA 
GATTATACTG ATTGCTGATT GGGTTATTAT C AG CT AG AT A ACAACTTGGG TCAGGTGCCA 
GGTCAACATA ACCTGGGCAA AACCAGTCTC ATCTGTGGCA GGACCATGTA CCAGCAGCCA 
GCCGTGACCC AATCTAGGAA AGCAAGTAGC ACATC.AATTT TAAATTTATT GTAAATGCCG 
TAGTAGAAGT GTTTTACTGT GAT AC ATT G A AACTTCTGGT CAATCAGAAA AAGGTTTTTT 
ATCAGAGATG CCAAGGTATT ATTTGATTTT CTTTATTCGC CGTGAAGAGA ATTTATGATT 
GCAAAAAGAG GAGTGTTTAC ATAAACTGAT AAAAAACTTG AGGAATTCAG CAGAAAACAG 
CCACGTGTTC CTG AAC ATT C TTCCATAAAA GTCTCACCAT GCCTGGCAGA GCCCTATTCA 
CCTTCGCT 



SEQ ID N0^18 {vitellogenin targeting sequence) 

ATGAGGGGGATCATACTGGCATTAGTGCTCACCCTTGTAGGCAGCCAGAAGTTTGACATTGGT 



SEQ ID NO: 19 ip!4€ protein) 
K Y KKAXj KKLiAKLL 



SEQ ID NO: 20 (pl4 6 coding sequence) 
AAATACAAAAAAGCACTOAAAAAACTGGCAAAACTGCTG 



SEQ ID NO: 21 (pro -insulin sequence) 

rTTGTGAACCAACACCTGTGCGGCTCACACCTGGTGGAAGCTCTCTACCTAGTGTGCGGGGAAGGAGGC 
TTCTTCTACACACCC&AGACCCGCCGGGAGGCAGAGGACCTGCAGGTGGGGCAGGTGGAGCTGGGCGGG 
GGCCCTGGTGCAGGCAGCCTGCAGCCCTTGGCCCTGGAGGGGTCCCTGCAGAAGCGTGGCATTGTGGAA 
CAATG CTGT AC C AG CAT CTG CT C CCT CT AC CAG CTGG AGAACTCTGC AACT AG 



SEQ ID NO: 22 {TAG sequence) 

ATLUBS2 1 53*92. 1 
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Fro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp 
Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Thr Thx Cys Be Leu Lys GJy Ser Cys 
Gly Trp He Gly Leu Leu Asp Asp Asp Asp Lys 



5 SEQ ID NO: 23 Cgp4 I epitope) 

Ala Tin Thr Cys Be Leu Lys Gly Ser Cys Gly Trp He Gly Leu Leu 



SEQ ID &Os24 (polynucleotide sequence encoding gp4I epitope) 

10 Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Thr Thr Cys lie Leu Lys Gly 
Ser Cys Gly Trp lie Gly Leu Leu Asp Asp Asp Asp Lys 



SEQ ID NO s 25 {repeat domain in TAG spacex- sequence) 
15 Pro Ala Asp Asp Ala 



SEQ ID MO: 26 'TAG spacer sequence) 

Pro Ala Asp Asp Ala Fro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp 
20 Ala Fro Ala Asp Asp Ala Fro Ala Asp Asp 

SEQ ID NO; 27 (Vit pro/ Vit fcarg/TAG/pro- insulin/ synthetic polyA) 
TGAATGTGTT CTTCTGTXAT CAATATAAAT CA C AG TTAGT GATGAAGTTG GCTGCAAGCC 
TGCATCAGTT CAGCTACTTG GCTGCATTXT: GTATTTGGTT CTGTAGGAAA TGCAAAAGGT 

25 TCTAGGCTGA CCTGCACTTC TATCCCTCTT GCCTTACTGC TGAGAATCTC TGCAGGTTTT 
AATTGTTCAC ATTTXOCTCC CATTT&CTFTT GGAAGATAAA ATATTTACAG AATGCTTATG 
AAAGCTTTGT TCATTTAAAA ATAT.TCCTOG TCAGCGTGAC CGGAGCTGAA AGAACACATT 
GATCCCGTGA TTT C A AT AAA TACATATGTT CCATATATTG TTTCTCAGTA GCGTCTTAAA 
TCATGTGCGT TGGTGCACAT ATGAATACAT G &A TftGG AAA GGTTTATCTG GATTACGCTC 

30 TGGCCTGCAG GAATGGCCAT AAACCAAAGC TGAC4GGAAGA GGGAGAGTAT AGTCAATGTA 
GATTATACTG ATTGCTGATT GGGTTATTAT CAGCT AGATA ACAACTTGGG TC&GGTGCCA 
GGTCAACATA ACCTGGGCAA AACCAGTCTC ATCTGTGGCA GGACCATGTA CCAGCAGCCA 
GCCGTGACCC AATCTAGGAA AGCAAGTAGC ACATCAATTT TAAATTTATT GTAAATGCCG 
TAGTAGAAGT GTTTTACTGT GATACATTGA AACTT CTGGT CAATCAGAAA AAGGTTTTTT 

35 ATCAGAGATG C CAAGGT ATT ATTTGATTTT CTTTATTCGC CGTGAAGAGA ATTTATGATT 
GCAAAAAGAG GAGTGTTTAC ATAAACTGAT AAAAAACTTG AGGAATTCAG CAGAAAACAG 
CCACGTGTTC CTGAACATTC TTCGATAAAA GTCTGACCAT GCCTGGCAGA GCCCTATTCA 
CCTTCGCTAT GAGGGGGATC. ATACTCGGAT TAGTGCTCAC CCTTGTAGGC AGCCAGAAGT 
TTGACATTGG TAGACTGAGA ATGGCAAGAA GAATGAGAAGA TGGTTTGTG AACCAACACC 

40 TGTGCGGCTCA CACCTGGTGG AAGCTCTCTA CCTAGTGTGCG GGGAACGAGG CTTCTTCTAC 
ACACCCAAGA CCCGCCGGGA GGCAGAGGAC CTGCAGGTGGG GCAGGTGGAG CTGGGCGGGG 
GCCCTGGTGC AGGCAGCCTG CAGCCCTTGG CCCTGGAGGGG TCCCTGCAGA AGCGTGGCAT 
T G T GG AA C AA TGCTOTACCA GCATCTGCTC CCTCTACCAGC TGGAGAACTA CTGCA AC TAG 
GG CGC CTG G AT CC AG AT CACTT CTGGGTAAT AAA AG AT C AGAG CT CT AG AG ATCTG TGTGTTGGTTTTT 

45 TGTGGATCTGCTGTGCCTTCTAGTTGGCAGCCAXGTGTTGTTTGCCCCTCCCCCGTGGGTTGCTTGACC 
CTGG AAGGTG C C ACTCC CACTGTC CTTTC CT AAT AAAATGAGG AAATT GC AT GG C ATT'GTCl'GAGTAGG 
TGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGCACAGCAAGGGGGAGGATTGGGAAGACAATAGCAGG 
CATGCTGGGGATG CGG T GG GGTCT ATGGGT ACCTCT CT CTCTCT CTCT GT CT CTCT CTCTCTCT CTCTC 
TCTCGGTACGTCTCTC 

50 

SSQ ID MO: 28 (synthetic poIyA sequence) 

GG CGC CTGGAT GO AGAT CACTT CTGG CTAAT AAAAG AT C AGAG C T CTAGAG ATCTGTGTGTTGGTTTTT 
TGTGGATCTGCTGTGCCTTCTAGTTGCCAGGCATCTGTTGTTTGGCCCTCCCCCGTGCCTTCCTTGACC 
CTGGAAGGTGCC&CtCCGACTO^ 
55 TG T CAT T CT ATTCTGGG GGG TGGGGTGGGG C AGC AC AG C AAGGGGG AGG ATTGGG AAGAG AAT AGCAGG 

CATGCTGGGGATGCGGTGGGCTCTATGGGTACCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC 
TCTCGGTACGTCTCTC 



ATLLHSC5 
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SEQ ID &0:29 { pTnMod ( Ova 1 / EMT ca9/PX4S/PA} ~ Chicken) 

CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG 50 
5 CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCTTCCCT TCCTTTCTCG CCACGTTCGC CGGCATCAGA TTGQCTATTG ISO 
GCCATTGCAT ACGTTGTATC CAT AT C ATAA TATGTACATT TATATTGGCT 200 
CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC TAGTTATTAA 250 
TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TCGAGTTCCG 3 00 

10 CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 3 50 
CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 4 00 
GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 4 50 
CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC CCTATTGACG 500 
TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA 550 

15 TGGGACTTTC CTACTTGGCA 8XACATCTAC GTATTAGTCA TCGCTATTAC 600 
CATGGTGATG GGGTTTTGGC AGTACATCAA TGGGCGTGGA TAQCGGTTTG 6 50 
ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 700 
TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ACA&CTCCGC 7 50 
CCCATTGACG CAAATGGGCG GT AG G CGTGT ACGGTGGGAG GTCTATATAA 800 

20 GC AG AG CT CG TTTAGTGAAC CGTCAGATCG CCTGGAGACG CCATCCACGC BSC 
TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 9 00 
GGAACGGTGC ATTGG&ACGC GGATTCCCCG TGCCAAGAGT GACGTAAGTA 950 
CCGCCTATAG ACTCTATAGG CACACCCCTT TGGCTCTTAT GCATGCTATA 1000 
CTGTTTTTGG eTTGGGGCCT ATACACCCCC GCTTCCXTAT GCTATAGGTG 10SC 

25 ATGGTATAGC TT AG C CT AT A GGTGTGGGTT ATTGACCATT ATTGACCACT HOC 
CCCCTATTGG TGACGATACT TTCCATTACT AATCCATAAC ATGGCTCTTT I ISC 
GCCACAACTA TCTCTATTGG CTATATGCCA ATACTCTGTC CTTCAGAGAC 1200 
TGACACGGAC TCTGTATTTT TACAGGATGG OGTCCCATTT ATTATTTACA 1250 
AATTCACATA TACAACAACG CCGTCCCCCG TGCCCGCAGT TTTTATTAAA 13 00 

30 CATAGCGTGG GAT CT CCA CG CG A AT C T C G G GTACGTGTTC CGGACATGGG 13 50 
CTCTTCTCCG GTAGCGGCGG AGCTTCCACA TCCGAGCCCT GGTCCCATGC 14 00 
CTCCAGCGGC TCATGGTCGC TCGGCAGCTC CTTGCTCCTA ACAGTGGAGG 14 50 
CCAGACTTAG GGACAGCACA ATGCCCACCA CCACCAGTGT GCCGCACAAG 1500 
GCCGTGGCGG TAGGGTATGT GTCTGAAAAT GAGCGTGGAG ATTGGGCTCG 1550 

35 CACGGCTGAC GCAGATGGAA GACTTAAGGC AGCGGCAGAA GAAGATGCAG 2 600 
GCAGCTGAGT TGTTGTATTC TGATAAGAGT CAGAGGTAAC TCCCGTTGCG 1650 
GTGCTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTTGCTGC 17 00 
CGCGCGCGCC ACCAGACATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 17 SO 
CCATGGGTCT TTTCTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT 1800 

40 TTACATGATT CTCTTTACCA ATTCTGCCCC GAATTACACT TAAAACGACT IS SO 
CAACAGCTTA ACGTTGGCTT GCCACGCATT ACTTGACTGT AAAACTCTCA 1900 
GTCTTACCGA ACTTGGCCGT AACCTGGCAA CCAAAGCGAG AA G AAA AC AT 1950 
AACATCAAAC GAATCGACCG ATTGTTAGGT AATCGTCACC TCCACAAAGA 2 000 
GCGACTCGCT GTATACCGTT GGCATGCTAG CTTTATCT GT TCGGGAATAC 2 050 

45 GATGCCCATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG CAAAAACGAC 2100 
TTATGGTATT GCGAGCTTCA GTCGCACTAC ACGGTCGTTC TGTTACTCTT 2150 
TATGAGAAAG CGTTCCCGCT TTCAGAGCAA TGTTCAAAGA AAGCTCATGA 22 00 
CCAATTTCTA GCCGACCTTG CG AG CAT T CT ACCGAGTAAC ACCACACCGC 2250 
TCATTGTCAG TGATGCTGGC TTTAAAGTGC CATGG TATA A ATCCGTTGAG 2 3 00 

50 AAGCTGGGTT GGTACTGGTT AAGTCGAGTA AGAGGAAAAG TACAATATGC 2 3 50 
AG A COT AGGA GCGGAAAACT GGAAACCTAT C AG CA ACT T A CATGATATGT 2 4 00 
CATCTAGTCA CTC AAAG ACT TT AG G CT ATA AGAGGCTGAC T AAAAG C AAT 2 4 50 
CCAATCTCAT GCCAAATTCT ATTGTATAAA TCTCGCTCTA AAGGCCGAAA 2500 
AAATCAGCGC TCGACACGGA CTCATTGTCA CCACCCGTCA CCTAAAATCT 2550 

55 ACTCAGCGTC GGCAAAGGAG CCATGGGTTC TAG C AACT AA CTTACCTGTT 2600 
GAAATTCGAA CACCCAAACA ACTTGTTAAT ATCTATTCGA AGCGAATGCA 2650 
GATXGAAGAA ACCTTCCGAG ACTTGAAAAG TCCTGCCTAC GGACTAGGCC 2700 
TACGC GAT AG CCGAACGAGC AGCTCAGAGC GTTTT GAT AT CATCCTGCTA 2750 
AT CGCCCTG A TGCTTCAACT AACATGTTGG CTTOCQQQCG TTCATGCTCA 2800 

60 GAAACAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC AGAAATCGAA 2 85 0 
ACGTACTCTC AACAGTTCGC TTAGGCATGG AAGTTTTGCG GCATTCTGGC 2900 
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TACACAATAA 
AAATTTATTC 
GATCACTTCT 
TTTTTTGTGG 
5 CCCTCCCCCG 
TTCCTAATAA 
CTATTCTGGG 
GACAATAGCA 
CTCTCTCTCT 

1 0 CTCTCTCTCT 
TTGACCCGGT 
CAATTACTCA 
CATCACAACA 
TATTTGAACA 

1 5 CCTATCCAAG 
TTAGGCTTGA 
ATCCAAGAGA 
GACCCTGAGC 
TTAAGGTGGA 

20 CACTCATTAG 
TGTGTGGAAT 
CATGATTACG 
GOAGCTCCAC 
AGGTCAGAAT 

25 AACAATAGCT 
TCCCTCGAAC 
ATCTGCCAGG 
AGTTCATATC 
GAGCTATGTT 

30 ACACCCATAA 
CGTCTGCTCT 
TTATTTCTCC 
TTATGTCCTG 
TCAAATGAAA 

35 ACTAACTAAT 
TTTCTGTTTT 
GAACATGAGC 
GATTAGCAGA 
TATTTGCTCT 

40 CCCAATCCCA 
00 AACCTGTG 
CGGATCCATG 
TATTCAAGGA 
CCCATTGCCA 

45 CAGCACCAGG 
GATTCGGAGA 
TCTTCACTTA 
TTCGTTCAGC 
TGCCAGAATA 

SO CCTATCAACT 
CTGGGTAGAA 

AAAGGACTGT 
TTTCAGAGTG 



55 TTGGTTTATT 
GAGCTTCCAT 
TGAAGTCTCA 
TGACTGAATG 
TACTTACCTC 

60 AATGGCTATG 
GCATCTCCTC 

ATLUB02 ViSWZ.i 



CAAGGGAAGA 

ACACATGOTT 

GGC T AAT AAA 

ATCTGCTGTG 

TGCCTTGCTT 

AAT G AG G AAA 

GGGTGGGGTG 

GGCATGCTGG 

CTCTCTCTCT 

CTCTCTCTCT 

GACCAAAGGT 

GTGCCTGTTA 

AAAACTGATT 

TTATCTTGAT 

AAGTGATGCC 

ATACATTACT 

A C» C A ACT i A>A 

ACTGATGAAT 

TACACATCTT 

GGACCCCAGG 

TGTGAGCGGA 

CCAAGCGCGC 

CGCGGTGGCG 

GGTTTCTTTA 

TCTATAACTG 

CATGAACACT 

CCATTAAGTT 

ATAAACACAT 



CTTACTCGTG 

ACGCTTTGGG 

AGATCAGAGC 

C CT'TCT AG T T 

GACCCTGGAA 

TTGCATCGCA 

GGGCAGCAGA;: 

GGATGCGGTG 

CTCTCTCTCT 

CTCTCTCTCT 

GCCTTTTATC 

TAAGCAGCAA 

TAAGAAATGG 

rv 



Vv 



TTGCTCTAT 
AAAGATAGAT 
TCACTCTAGT 
TATTTTGTGA 

cctagcatgg 
cagacttctg 
aattgctaat 
cttaaagatc 
aatatttccc 
acaggcagaa 
ccattcaatc: 

TTAAATGATT 

GGTGGGTCAC 

GGCTCC&TGG 

GCTCAAAGTC 

TCATGTCAGC 

AC AC AG AT A A 

CAGTATTGAA 

GAGACATCCT 

CTTGCCAGTA 

CTTGCAGTGT 

TTCAAACAGC 

AGTCAGACAA 

TTCTCAAACT 

GGGAGAAAAC 

ACTGAGCAAG 

TAGAGTGGCA 

TTGCCAGTGG 

GGCCTTGAGC 

GACCAGTTCT: 

GCATGAAGAT 

GGCATTACTG 

AGCAGAGAGC 



TATCATTGGT 
GGTAAGGTAA 
AGCTTTCCTG: 
CCCCTAATGA 
GTCATATGAT 
CTTTACACTT 
TAACAATTYC 
AATTAACCCT 
GCCGCTCTAG 
CTGTTTGTCA 
AAATATATTT 
CCTCOAGCTG 
ATTCATGGAA 
TTGAAATTGA 
CTCAGAAAAA 



GCTGC AACCC 

GAAATTATGA 

TCTAGAGATC 

GCCAGCCATC 

GGTGCCACTC 

TTGTCTGAGT 

GCAAGGGGGA 

GGCTCTATGG 

CTCTCGGTAC 

CGGTACCAGG 

ATCACTTTAA 

TTAATTATGA 

TTGGTCTGCC 

ATAATAATAA 

TGGAATGAAC 

ACGCCATTGT 

ACGGAATGTT 

TTTTGGTAA& 

CCCGGTAATG 

TATGCTTCCG 



CTCAGATGCA 

GTCTGTTACT 

TATGTTTTCC 

CGATTATCTG 

AGTCTTCTCT 

AACACATTGT 

CAAAATGGAC 

TCTATGGCGT 

AATTCAGGCT 

GCGCAGCAAG 

CACCATGCCA 

TCTAGCCATG 

ATAAGGTTGT 

GCTCAGTGTG 

CAACCAAATC 

GACTTTATGC 

GTGAAGGAAC 

TGGAGATCAA 

ATGGAATTAT 

GCAATGGTTC 

ATTTAAGGAT 

AAAGCAAACC 

TCAATGGCTT 

GAGAATGAGC 

AGCTTGAGAG 

AATGTTATGG 

GGAGGAAAAA 

ACGTGTTTAG 

CTGAAGATAT 



CACTAAAGGG 
AACTAGTGGA 
ATTCTATTAT 
GCTATTGTAT 

GATCTTTGAG 

GTATTGTTTT 

AAGTTTGTTA 

CAGCTATAGG 

TCCTTCACAT 

AGGTCACGTC 

CGTTGTAGAT 

ACAACCATAG 

ATCTCTAAGG 

GTTGTAACTG 

eCGATCCAAC 

TACCCAGAAT 

CTATTGAAAG 

CAAAGGTCAA 

ATATATTCCC 

CATGG AftTTT 

ATGAGAACAT 

wlAi ACC I GG 

TCGCTTTGAT 

GCACATCTGT 

ACCAAACCAA 

TGAAGAGAGA 

TGTATAGAGG 

GCCA0AGAGC 

CAGAAATGTC 

TGGTTAATGC 

GAAGACACAC 

TGTGCAGATG 

CTGAGAAAAT 

ATGTTGGTGC 

T AT A AT C A AC 

AAGAGAGGAA 
TACAACCTCA 
CTCTTCAGCC 
CTCAAGCTGT 
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TACTAGCTGA 
TAATGATCCA 
TGT GTGTTGG 
TGTTGTTTGC 
CCACTGTCCT 
AGGTGTCATT 
GGATTGGGAA 
GTACCTCTCT 
CTCTCTCTCT 
TGCTGAAGAA 
AAATAAAAAA 
TTGATGCCTA 
TTAGAAAGTA 
AAACCTTATC 
TTGAAAAAAA 
CAGCAAATTG 
AATTCTCGTT 
AATCATTAAG 
TGAGTTAGCT 
GCTCGTATGT 
CAGCTATGAC 
AACAAAAGCT 
TCCCCCGGGG 
TTCAATACAG 
ATTATGATTG 
TTCCTCTGTC 
GAACACTGCA 
GCATTGTATG 
TAAAGCATTC 
AAAGAAAGTG 
GCATGCTTCT 
TTGTTCTCAC 
ACAAGAAGGA 
TAATAAGCAC 
TTCCCACATT 
AAGCTCAATG 
AGTCCTGATG 
TAAAAACTAA 
TAAAATCTAA 
ACTTCTGAAG 
CAGGGCTCAG 
TGTTTTGATG 
CTTCTACTGC 
GTGCAAAAGA 
AAACTTCCAG 
AAACGTTCAC 
ATGATGTTTA 

TACGCAATCC 
AGGCTTGGAA 

TCATCAATTC 
CTTCAGCCAA 
CATTGTCTTC 
AAGCAATGCC 
ATGTACCAGA 
GAAGATCCTG 
TGTTGCCTGA 
TTTGAAAAAC 
GATCAAAGTG 
CATCTGTCTT 
AATCTGTCTG 
CCATGCAGCA 



2 950 
3000 
30S0 
3100 
3150 
3200 
3250 



3300 
33SG 
3400 
3450 
3500 
3550 
36 00 

3 700 
3750 
3800 
3850 
3900 
3950 
4000 
4050 
4100 
4150 
4200 
42S0 
43C0 
4350 
44 00 
4450 
4500 

4 550 

4 6 00 
4650 
4700 
4750 
4800 
4S50 
4900 
4950 

5 000 
5050 
5100 
5150 
5200 
5250 
5300 
5350 
5400 
5450 
55 00 
5550 
5S00 
5S50 
5 7 00 
5 750 
5800 
5S50 
5900 
5 950 
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C ATG C AO AAA TCAATGAAGC AGGCAGAGAG GTGGTAGGGT CAGCAGAGGC 6 000 
TGGAGTGGAT GCTGCAAGCG TCTCTGAAGA ATTTAGGGCT GACCATCCAT 6 050 
TCCXCTTCTG TATCAAGCAC ATCGCAACCA ACGCCGTTCT CTTCTTTGGC 6100 
AQATGTGTTT CCCCTCCGCG GCCAGCAGAT GACGCACCAG CAGATGACGC 6 ISO 
5 AC C AG C AG AT GACGCACCAG CAGATGACGC ACCAGCAGAT GACGCACCAG 6200 
CAGATGACGC A^CAACATGT AT OCT G AAAG GCTCTTGTGG CTGGATCGGC 6250 
CTGCTGGATG ACGATG&CA& A&AATACAAA AAAGCACTGA AAAAACTGGC €3 00 
AAAACTGCTG TAATGAGGGC GCCTGGATCC AG AT C ACT T C TGGCTAATAA 6350 
AAGATCAGAG CT GT AG AG AT CTGTGTGTTG GTTTTTTGTG GATCTG CTGT 6400 
10 GCCTTCTAGT TGCCAGC CAT CTGTTGTTTG CCCCTCCCCC GTGCCTTCCT 6450 
TGACCCTGGA AGGTGCCACT CCCACTGTCC TTTCCTAATA AAATGAGGAA 6500 
ATTGCATCGC ATTGTCTQAG TAGGTGTCAT TCTATTCTGG GGGGTGGGGT 6550 
GGGGCAGCAC AGGAAGGGGG AGGATTGGGA AGACAATAGC AGGCATGCTG 6600 
GGGATGCGGT GGGCTCTATG GGTACCTCTC TCTCTCTCTC TCTCTCTCTC 665 0 
15 TCTCTCTCTC TCTCTOOOTA CCTCTCTCGA GGGGGGGCCC GGTACCCAAT 6700 
TCGCCCTATA GTGAGTCGTA TTACGCGCGC TCACTGGCCG TCGTTTTACA 6 75 0 
ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTA&T CGCCTTGCAG 6800 
CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT 6 850 
CGCCCTTCCC AACAGTTGCG CAGCCTGAAT GGCGAATGGA AATTGTAAGC 6 900 
20 GTTAATATTT TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT 6950 
TTTTAACCAA TAGGCCGAAA TCGGCAAAAT CCCTTATAAA TCAAAAGAAT 7 000 
AGACCGAOAT AGGGTTGAGT GTTGTTCCAG TTTGGAACAA GAGTCCACTA 7 0S0 
TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG TCTATCAGGG 7100 
CG ATGGCCCA CTACTCCGGG ATCATATGAC AAGATGTGTA TCCACCTTAA 7 ISO 
25 CTTA&TGATT TTTACCAAAA TCATTAGGGG ATTCATCAGT GCTCAGGGTC 7200 
A&CG&GAATT AACATTCCGT CAGGA&AGCT TATGATGATG ATGTGCTTAA 7250 
AAACTT ACT C AATGGCTGGT TATGCATAXC GCAATACATG CGAAAAACCT 7300 
AAAAGAGCTT GCCGATAAAA AAGGC CAAXT TATTGCTATT TACCGCGGCT 73 50 
TTTTATTGAG CTTGAAAGAT AAATAAAATA GATAGGTTTT ATTTGAAGCT 74 00 
30 &AAXCTTCTT TATCGTAAAA AATGCCCTCT TGGGTTATCA AGAGGGTCAT 7450 
TATATTTCGC GGAATAACAT CATTTGGTGA CGAAATAACT AAGCACTTGT 7500 
CT CCTGTTT A CTCCCCTGAG CTTGAGGGGT TAACATGAAG GTCATCGATA 7550 
GCAGGATAAT AATACAGTAA AACGCTAAAC CAATAATCCA AATCCAGCCA 7600 
TCCCAAATTG GTAGTGAATG ATTATAAATA ACAGCAAACA GTAATGGGCC 7650 
35 AATAACACCG GTTGCATTGG TAAGGCTCAC CAATAATCCC TGT AAAG C AC 7 700 
CTTGCTGATG ACTCTTTGTT TGGATAGACA TCACTCCCTG TAATGCAGGT 7750 
AAAGCGATCC CACCACCAGC CAATAAAATT AAAACAGGGA AAACTAACCA 7800 
ACCTTCAGAT ATAAACGCTA AAAAGGCAAA TO CACT ACT A TCTGCAATAA 7 850 
AT CCG AG C AG TACTGCCGTT TTTTCGCCCC ATTTAGTGGC TATTCTTCCT 7 900 
40 G C C AC AAAG G CTTGGAATAC TGAGTGTAAA AGACCAAGAC CCGCTAATGA 7 95 0 
AAAGCCAACC AT C ATGCTAT TCCATCCAAA ACGATTTTCG GTAAATAGCA 8000 
CCCACACCGT TGCGGGAATT TGGCCTATCA ATTG CGCTG A AAAATMATA 8050 
ATCAACAAAA TGGCATCGTT TTAAATAAAG TGATGTATAC CGAATTCAGC 3100 
TTTTGTTCCC TTTAGTGAGG GTTAATTGCG CGCTTGGCGT AATCATGGTC 315 0 
45 RTAGCTGTTT CCTGTGTGAA ATTGTTATCC GCTCACAATT CCACACAACA 3200 
TACGAGCCGG A&GCATAAAG TGTAAAGCCT GGGGTGCCTA ATGAGTGAGC 8 250 
T AACT C AC AT TAATTGCGTT GCGCTCACTG CCCGCTTTCC AGTCGGGAAA 8 300 
CCTGTCGTGC CAGCTGCATT &ATGAATCGG CCAACGCGCG GGGAGAGGCG 8 35 0 
GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC 8400 
50 TCGGTCGTTC GGGTGCGGCG AGCGGTATCA GCTCACTCAA AGGCGGTAAT 8450 
ACGGTTATCC AC AG AA? C AG GGGATAACGC AGGAAAGAAC ATGTGAGCAA 8500 
A&GGCCAGCA AAAGGCCAOG AACCGTAAAA AGGCCGCGTT GCTGGCGTTT 8550 
TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC GACGCTCAAG 8 600 
TCAGAGGTGG CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC 8 650 
55 CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA 8700 
TACCTGTCCG CCTTTCXCCC TTCGGGAAGC GTGGCGCTTT CTCATAGCTC 875 0 
ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT 0 800 
GTGTGCACGA ACCCCCCGTT CAGCCCGACC Q CTG CGCCTT ATCCGGTAAC 8850 
TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC CACTGGCAGC 8 900 
60 AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG 8S5Q 
AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG GACAGTATTT 9000 
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GGTATCTGCG 

CTCTTGATCC 

GC AAG CAG C A 

ATCTTTTCTA 

GATTTTGGTC 

ATTAAAAATG 

TCTGACAGTT 

TC7ATTTCGT 

CGATACGGGA 

GACCCACGCT 

AAGGGCCGAG 

CTATTAATTG 

TTGCGCAACG 

GTTTGGTATG 

CATGATCCCC 

ATCGTTGTCA 

AGCACTGCAT 

TGACTGGTGA 

CCGAGTTGCT 

CAGAACTTTA 

TCTCAAGGAT 

GCACCCAACT 

AG C AAAAAC A 

GGAAATGTTG 

TATCAGGGTT 

AAATAAACAA 



CTCTGCTGAA 
GG C AAA C AAA 
GATTACGCGC 
CGGGGTCTGA 
ATGAG&TTAT 
AAGTTTT AAA 
ACCAATGCTT 
TCATCCATAG 
GGGCTTACCA 
CACCGGCTGC 
CGCAGAAGTG 
TTGCCGGGAA 
TTGTTGGCAT 
GCTTCATTCA 
GATGTTQTGG 
GAAGTAAGTT 
AATTCTCTTA 
GTACTCAft.CC 

AAAGTGCTCA 
CTTACCGCTG 
GAT CTTCAGC 
GGAAGG C A AA 
AATACTCATA 
ATTGTCTCAT 
ATAGGGGTTC 



GGGAGTTACC 
CCACCGCTGG 
AG AAA AAA AG 
CGCrCAGTGG 
CAAAAAGGAT 
TCAATCTAAA 



AAT C AGTG AG 
TTGCCTGACT 
TCTGGCCCCA 
AGATTTATCA 
GTCCTGCAAC 
GCTAGAGTAA 
TGGTACAGGG 



GCTGCGGTT 



C 



AAAAAAGGGG 
GGCCGGAGTG 
CTGTCATGCC 
AAGTCATTCT 
GTCAATACGG 
TCATTGGAAA 
TTGAGATCCA 



ATGGCGCAAA 
CTCTTCCTTT 
GAGCGGATAC 
CGCGCAC ATT 



TTCGGAAAAA 

TAGCGGTGGT 

GATCTCAAGA 

AACGAAAACT 

CTTCACCTAG 

GTATATATGA 

GCACCTATCT 

CCCCOTCGTG 

GTGCTGCAAT 

GCAATAAACC 

TT7ATCCGCC 

GTAGTTCGCC 

ATCGTGGTGT 

CCAACGATCA 

TTAGCTCCTT 

TTATCACTCA 

ATCCGTAAGA 

GAGAATAGTG 

GATAATACCG 

ACGTTCTTCG 

GTTGGATGTA 

TTCACCAGCG 

AAAGGGAATA 

TTCAATATTA 

ATATTTGAAT 

TCCCCGAAAA 



GAGTTGGTAG 90S0 
TTTTTTGTTT 9100 
AGATCCTTTG 9150 
CACGTTAAGG 9200 
ATCCTTTTAA 92 50 
OTAAACTTGG 93 QG 
G&GGGATCTG £350 
TAGATAACTA S4 00 
GATACCGCGA 3450 
AGCCAGCCGG 9500 
TCCATCCAGT 3550 
AGTTAATAGT 96 00 
CACGCTCGTC 9650 
AGGCGAGTTA 9700 
CGGTCCTCCG 5? 750 
TGGTTATGGC 9800 
TGCTTTTCTG SSSO 
XATGCGGCGA 9900 
CGGCAC&TAG 9350 
GGGGGAAAAC 10000 
ACCCACTCGT 10050 
TTTCTGGGXG 101 GO 
AGGGCGACAC 10 ISO 
TTGAAGCATT 102 00 
GTATTT AG AA 10250 
GTGCCAC 10237 



SEQ ID MOi3D { pTnMod < Ova I / ENT tag/£146/PA) - QUAIL) 

CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGOTGT GGTGGTTACG 50 
CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCTTCCCT TCCTTTCTCG eCACGTTCGC CGGCATGAGA TTGGCTATTG ISO 
G C CATTGC AT ACGTTGTATG GATATGATAA TATGTAGATT TATATTGGCT 200 
CATGTCCAAC ATTAOCGCCA TGTOGACATT GATTATTGAC TAGTTATTAA 250 
TAGXAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TG0AGTTCCG 300 
CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 350 
CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 400 
GGG&CTTTCC ATTGACGTGA ATGGGTGGAG TATTTACGGT AAACTGCCCA 450 
CTTGGCAGTA CATCAAGTGT AT CATATGC C AAGTACGCCC CCTATTGACG 500 
TCAATGACGG TAAATGGCCC GCGTGGCATT ATGCCCAGTA CATGACCTTA 550 
TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 600 
CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 650 
ACTCACGGGQ ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGA.QTTTG 700 
TTTXGGCACC AAAATCAAGG GGACTTTCCA AAATGTCGTA ACAACTCCGC 750 
CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GT CT AT ATA A 800 
GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGAGG CCATCCAGGC 850 
TGTTTTGACC TCCATAGAAG ACACCGGGAG CGATCCAGCC TCCGCGGCCG 90 0 
GGAACGGTGC ATTGGAACGC GOATTCCCCG TGCCAAGAGT GACGTAAGTA 95 0 
CCGC CTAT AG ACTCTATAGG CACACCCCTT TGGCTCTTAT GCATGCTATA 1000 
CTGTTTTTGG CTTGGGG GGT AT AC ACCGCC GCTTCCTTAT GCTATAGGTG 105 0 
ATGGTATAGC TT AG C CT ATA GGTGTGGSTT ATTGACCATT ATTGACCACT 1100 
CCCCTATTGG TGACGATACT TTCCATTACT AATCCATAAC ATGGCTCXTT 1150 
GCCACAACTA TCTCTATTGG CT AT ATGCC A ATACT CTGT C CTTCAGAGAC 1200 
TGACACGGAC TCTGTATTTT TACAGOATGG GGTCCCATTT ATTATTTACA 125 0 
AATTCACATA TACAACAACG CCGTCCCCCG TGCCCGCAGT TTTTATTAAA 1309 
C AT AG CGTGG GATCTCGACG CGAATCTCGG GTAGGTGTTC CGGACATGGG 135 0 
CTCTTCTCCG GTAGCGGCGG AGCTTCCACA TCCGAGCCCT GGTCCCATGC 140 0 
CT CC AG C GG C TCATGGTCGC TCGGCAGCTC CTTGCTCCTA ACAGTGGAGG 14 5 0 
CCAGACTTAG GCACAGCACA ATGCCCACCA CCACCAGTGT GCCGCACAAG 1500 
GC CGTGGCGG TAGGGTATGT GTCTGAAAAT GAGCGTGGAG ATTGGGCTCG 155 0 
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CACGGCTGAC GCAGATGGAA GACTTAAGGC AGCGGCAGAA GAAGATGCAG 16 00 
GCAGCTGAGT TGTTGTATTC TGAT&AGAGT CAGAGGTAAC TCCCOTTGCG 1650 
GTGCTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTTGCTGC 1700 
CGCGCGCGCC ACCAGACATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 17 SO 
5 CCATGGGTCT TTTCTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT 1800 
TTACATGATT CTCTTTACCA ATTCTGCCCC GAATTACACT TAAAACGACT 1850 
CAACAGCTTA ACGTTGGCTT GCCACGCATT ACTTGACTGT AAAACTCTCA 1900 
CTCTTACCGA ACTTGGCCGT AACCTGCCAA CCAAAGCGAG AACAAAACAT X9S0 
AACATCAAAC GAATCGACCG ATTGTTAGGT AATCGTCACC TCCACAAAGA 2 900 

10 GCGACTCGCT GT AT ACCG TT GGCATGCTAG CTTTATCTOT TCGGGAATAC 2 050 
GATGCCCATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG CAAAAACGAC 2100 
TTATGGTATT GCGAGCTTCA GTCGCACTAC ACGGTCGTTC TGTTACTCTT 2 150 
TATGAGAAAG "OGTTCCCGCT TT GAG AG C AA TGTTGAAAGA AAGCTCATGA 2200 
CCAATTTCTA GCCGACCTTG GG AG C ATT CT ACCGAGTAAC ACCACACCGC 2250 

15 TCATTGTCAG TGATGCTGGC TTTAAAGTGC CATGGTATAA ATCCGTTGAG 2300 
AAGGTGGGTT GGTACTGGTT AAGTCGAGTA AGAGGAAAAG TACAATATGC 23 50 
AGACCTAGGA GCGGAAAACT GGAAACCTAT CAG CAACTT A CATG&TATGT 24 00 
CATCTAGTCA CTCAAAGACT TTAGGCTATA AGAGG CTG AC TAAAAGCAAT 2450 
CCAATCTCAT GCCAAATTCT ATT GT AT AAA TCTCGCTCTA AAGGCCGAAA 2 500 

20 AAATCAGCGC TCGACACGGA CTCATTGTCA CCACCCGTCA CCTAAAATCT 2550 
ACT CAGCGTC GGCAAAGGAG CCATGGGTTC TAGCAACTAA CTTACCTGTT 2 600 
GAAATTCGAA CACCCAAACA ACTTGTT&AT ATCTATTCGA &GCGAATGCA 2650 
GATTGAAG.AA &CCTTCCGAG ACTTGAAAAG TCCTGCCTAC GGAGT AOGCC 2 70 0 
TACGC CAT AG CCGAACGAGC AGCTCAGAGG GTTTTGATAT CATGCTGCTA 2 750 

25 ATCGCCCTGA TGCTTCAACT AACATGTTGG CTTGCGGGCG TTCATGCTCA 2800 
GAA&CAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC AGAAATCGAA 2850 
ACGTACTCTC AACAGTTCGC TTAGGCATGG AAGTTTTGCG GCATTCTGGC 2 900 
TACACAATAA CAAGGGAAGA CTTACTCGTG GCTGCAACCC TACT AG CTC A 2 9SQ 
AAATTTATTC ACACATGGTT ACGCTTTGGG GAAATTATGA TAATGATCCA 3 000 

30 GATCACTTCT GGCTAATAAA AGATCAGAGC TCTAGAGATC TGTGTGTTGG 3 05 0 
TTTTTTGTGG ATCTGCTGTG CCTTCTAGTT GCCAGCCATC TGTTGTTTGC 3100 
CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT 3 ISO 
TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3 200 
CTATT CTGGG GGGTGGGGTG GGGCAGCACA GCAAGGGGGA GGATTGGGAA 3 25 0 

35 GACAATAGCA GGCATGCTGG GGATGCGGTG GGCTCTATGG GTACCTCTCT 3 300 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCGGTAC CTCTCTCTCT 3 35 0 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCAGG TGCTGAAGAA 3400 
TTGACCCGGT GACCAAAGGT GCGTTTTATC ATCACTTTAA AAATAAAAAA 3450 
CAATTACTGA GTGCCTGTTA TAAGCAGCAA TTAATTATGA TTGATGCCTA 3500 

40 CATCACAACA AAAACTGATT TAACAAATGG TTGGTCTGCC TTAGAAAGTA 3550 
TATTTGA&CA TTATCTTGAT TATATTATTG ATAATAATAA AAACCTTATC 3 600 
CCTATCCAAG AAGTGATGCC TATCATTGGT TGGAATGAAC TTGAAAAAAA 3 £5 0 
TTAGCCTTGA ATACATTACT GGTAAGGTAA ACGCCATTGT CAGCAAATTG 3 70 0 
AT CCAAG AG A ACCAACTTAA AG CTTTCCTG AGGGAATGTT AATTCTCGTT 3 75 0 

45 GACCCTGAGC ACTGATGAAT CCCCTAATGA TTTTGGTAAA AATCATTAAG 3800 
TTAAGGTGGA TACACATCTT GTCATATGAT CCCGGTAATG TGAGTTAGCT 3 850 
G A CT C ATT AG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT 3 500 
TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA C AGCTATGAC 3 9 5 0 
CATGATTACG CCAAGCGCGC AATTAACGCT CACTAAAGGG AACAAAAGGT 4 000 

50 GGAGCTCCAC CGCGGTGGCG GCCGCTCTAG AACTAGTGGA TCCCCCGGGG 4 05 0 
AGGTCAGAAT GGTTTCTTTA CTGTTTGTCA ATTCTAT T AT TTCAATACAG 4 1G0 
AACAAAAGGT TCTATAACTG AAATATATTT GCTATTGTAT ATTATGATTG 4150 
TCeCTCGAAC CAT G AAC ACT CCTCCAGCTG AATTTCACAA TTCCTCTGTC 4 2 GO 
AT CTGCC AGG CTGGAAGATC ATGGAAGATC T CTGAGG AAC ATTGCAAGTT 4 2 50 

55 CATACCATAA ACTCATTTGG AATTGAGTAT TATTTTGGTT TGAATGGAGC 4 3 00 
TATGTTTTGC AGTTCCCTCA GAAG AAA AG C TTGTTATAAA GCGTCTACAC 4 350 
CCATCAAAAG ATATATTTAA ATATTCCAAC TACAGAAAGA TTTTGTCTGC 4 4 GO 
TCTTCACTCT GATCTCAGTT GGTTT CTTC A CGTACATGCT TCTTTATTTG 4 450 
CCTATTTTGT CAAGAAAATA ATAGGTCAAG TCCTGTTCTC ACTTATCTCC 4 500 

60 TGCCTAGCAT GGCTTAGATG CACGTTGTAC ATTCAAGAAG GATGAAATGA 4 550 
AACAGACTTC TGGTCTGTTA CAAGAACCAT AGTAATAAAC AGACTAACTA 4 600 
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1 



15 



20 



25 



30 



5 



40 



45 



50 



60 



AT AATTG CT A 
TTA&GATCCC 
TATTTCTCAG 
GAAAACACTT 
ATCCAAAATG 
ATTTCTATGG 
CCCAATTCAG 
TCCATCGGTG 
CAAAGTCCAC 
TGTCAACTCT 
CAGATAAATA 
TATTGAAGCT 
ACAT&GTCAA 
G C C AG TAG AO 
GCAATGTGTG 
&AACAGCTGC 
CAGACAAACG 
TCAAACTGCA 
AGAAAGCATT 
GAGCAAGAA& 
AG TGG CATC A 
CCAGTGGAAC 
w J. i.GAvsuAtfG 
CAGTTCTAGT 
TGAAOATGGA 
ATT&CTGACC 
AGGGAGCCTG 
ATGAAGCGGG 
ACTGAAGAAT 
CGAAACCAAC 
CAGATGACGC 
GACGCACCAG 
GAAAGGCTCT 
AC AAAAAAG C 
GATCCAGATC 
TGXTGGTTTT 
GTTTGCCCCT 
TGTCCTTTCC 
GTCATTCTAT 
TGGGAAGACA 
CTCTCTCTCT 

CGCGCTCACT 
GTTACCCAAC 
TAATAGCGAA 
TGAATGGCGA 
TA&ATTTTTG 
AAAATCCCTT 
TCCAGTTTGG 
AAGGGCGAAA 
ATGACAAGAT 
AGGGGATTCA 

«Avj JU s A i vj A 
ATATCGCAAT 
CAATTTATTG 
AAATAGATAG 

GGTGACGAAA 
GGGGTTAACA 
-i J^v^Ai^«- \y AiAi i A 
AAATAACAGC 

ATLUB92 1 33492 ! 



ATTATGTTTT 

ATTATCTGGT 

TCTTTTCTCC 

TGTTACCCAG 

GACCTATTGA 

CGTCAAAGGT 

GCTATATATT 

CAGCAAGCAT 

CATGCCAATG 

GGCCATGGTC 

AGGTTGTTCA 

CAGTGTGGCA 

CCAAATCACC 

TTTAT GCTCA 

AAGGAACTGT 

AGATCAAGCC 

GAATTATCAG 

ATGGTCCTGG 

TAAGGCTGAA 

GCAAACCTGT 

ATGGCTTCTG 

AATGAGCATG 

TTGAGAGTAT 

ATTATGGAAG 

GGAGAAATAC 

TGTTCAGCTC 

AAGAT&TCTC 

CAGAGATGTG 

TTAGGGCTGA 

GCCATTCTCC 

ACCAGCAGAT 

CAGATGACGC 

TGTGGCTGGA 

ACTGAA&AA& 

ACTTCXGGCT 

TTGTGGATCT 

CCCCCGTGCC 

TA&TAAAATG 

TCTGGGGGGT. 

ATAGCAGGCA 

CTCTCTCTCT 

GG C C CGGT AC 

GGCCGTCGTT 

TTAATCGCCT 

GAGGCCCGCA 

ATGGAAATTG 

TT&AATCaGC 

AT AAAT C AAA 

AACAAGAGTC 

AACCGTCTAT 

GTGTATCCAC 

TCAGTGCTCA 

TGATGATGTG 

ACATGCGAAA 

CTATTTACCG 

QTTTTATTTG 

TATCAAGAGG 

TAACTAAGCA 

TGAAGGTCAT 

ATCCAAATCC 

AAACAGTAAT 



CCATCTCTAA 
TGTAACTGAA 
AGCJUTCCTG 

aattaaaaac 

AACTAAAATC 
CAAACTTTXG 
CCCCAGGGCT 
GGAATTTTGT 
ACAACATGCT 
TTCCTAGGTG 
CTTTGATAAA 
CATC TGTAA A 



GGTTCCCACA 
GCTCAATGGA 
ACGGATTAGA 
TAATATTTGC 
TGAGCCAATC 
AAGGGAACCT 
CAGCCAGTGG 
TTTGATGT AT 
CTACTCCCCC 
CAAAAGACAG 
CTTC C AGO AT 
TGTXCACTCT 



AGAGACATAC 



AGAGGCCTCA 
AAACATCCTT 
TTAATGCCAT 
GACACGCAAA 
GCAGATGATG 
AGAAAATGAA 



^1.^!^ x'TT 
AGAGGAAGGT 
AACCTCACAT 
XT C AG CC A AT 
AA0CTGTCCA 
GTAGGCTCAC 
CCATCCATXC 
TCTTTGGCAG 
GACGCACCAG 
ACCAGCAGAT 
TCGGCCTGCT 
GTGGC AAAAC 
AATAAAAGAT 
GCTGTGCCTT 
TTCCTTGACC 
&GGAAATTGC 
GG'GGTG.GGGC 
TGCTGGGGAT 
TCTCTCT 



CCAATTCGCC 
TTACAACGTC 
TGCAGCACAT 
CCGATCGCCC 
TAAGCGTTAA 
TCATTTTTTA 
AGAATAGACC 
CACTATTAAA 
CAGGGCGATG 
CTTAACTTAA 
GGGTC&ACGA 
CTTAAAAACT 
AACCTAAAAG 
CGGCTTTTTA 
A AG CT AAATC 
GTCATTATAT 
CTTGTCTCCT 
CGATAGCAGG 
AGCCATCCCA 
GGGCCAATAA 



ACAGTCGTGC 

CTTAGAATCC 

TCAATGCCTG 

CAGCCAAGCT 

TGCCTTCAAG 

CAATACCTTT 

TACCAGATTG 

GATCCTGGAG 

TGCCTGATGA 

GAAAAACTGA 

CAAAGXGTAC 

CTCTCTTAAT 

CTGT GTGGC A 

TGCAGCACAT 

C AGAG GCTGG 

CTCTTCTGTG 

ATGTGTTTCT 

CAGATGACGC 

GACGCAACAA 

GGATGACGAT 

TGCTGTAATG 

CAGAGCTCTA 

CTAGTTGCCA 

CTGGAAGGTG 

ATCGCATTGT 

AGCACAGCAA 

GCGGTGGGCT 

CTCTCTCTCT 

CTATAGTGAG 

GTGACTGGGA 

CCCCCTfcTCG 

TXCCCAACAG 

TATTTTGTTA 

ACCAATAGGC 

GAGATAGGGT 

GAACGTGGAC 

GCCCACTACT 

TGATTXTTAC 

GAftTTAACAT 

TACT C&&TGG 

AGCTTGCCGA 

TTGAGCTTGA 

TTCTTXATCG 

TTCGCGGAAT 

GTTTACTCCG 

AT AAT AAT A C 

AATTGGTAGT 

CACCGGTTGC 
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TTTTTCTGTT 
ACATGAACAG 
AGAACXGGCA 
TCXCCCTTCA 
CCAsTTAAATT 
GTGGGTGGGT 
ATCCATGGGC 
TCAAGGAGCT 
TTTGCCATCT 

TCGGAGACAG 

TCACTl'AGAG 

GTTCAGCCTT 

CGGAATACTT 

GTCAACTTTC 

GGTAGAAAGT 

CCGTGGATTC 

GGACTGTGGG 

CAGAGTGACT 

GTTCATTTAA 

CTTCGATXTG 

TGXCTCAGGC 

CTGAATGGAC 

TTACCTCGCA 

GGCTATGGGA 

TCTCCTCAGT 

GCAGAAATCA 

AGTGGATGCT 

TCAAGCACAT 

CCGCGGCCAG 

ACCAGCAGAT 

CATGTATCCT 

G A C A A AAA AT 

AGGGCGCCTG 

GAGATCTGTG 

GCCAXCTGTT 

CCACTCCCAC 

CTGAGTAGGT 

GGGGGAGGAT 

GTATGGGTAC 

CGGTACCTCT 

TCGTATTACG 

AAACCCTGGC 

CCAGCTGGCG 

TTGCGCAGCC 

AAATTCGCGT 

CGAAATCGGC 

TGAGTGTTGT 

TCCAACGTCA 

CCGGGATCAT 

C AA AAT C ATT 

TCCGTCAGGA 

CTGGTTATGC 

TAAAAAAGGC 

AAG AT A AAT A 

TAAAAAATGC 

CTGAGCTTGA 
AGT^AAACGC 
GAATGATTAT 
ATTGGTAAGG 



4650 
4 700 
4 750 
4800 

4 850 
4900 
4950 
5000 

5 050 
5100 
5150 
S2 00 
5250 
5300 
5350 
5400 
5450 
5500 
5550 
5600 
5650 
5700 
5750 
5800 
585 0 
5 900 
55S0 
6000 
6050 
6100 
6150 
6200 
6250 
6300 
6350 
6400 
64S0 
650 0 
655 0 
6600 
665 0 
6700 
6750 
6800 
6850 
6900 
6550 
7000 
7050 
7100 
7150 
7200 
7250 
7300 
7350 
74 00 
7450 
7S00 
7550 
760 0 
7650 
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CTCACCAATA AT C CCTGT A A AGCACCTTGC TGATGACTCT TTGTTTGGAT 77 DO 
AGACATCACT CCGTGTAATG CAGGTAAAGC GATCCCACCA CCAGCCAATA 77 50 
AAATTAAAAC AGGGAAAACT AACCAACCTT CAGATATAAA CGCTAAAAAG 7800 
GCAAATGCAC TACTATCTGC AATAAATCCG AGCAGTACTG CCGTTTTTTC 7850 
5 GCCCCATTTA GTGGCTATTC TTCCTGCCAC AAAGGCTTGG AATACTGAGT 7 900 
GTAAAAGACC AAGACCCGCT &&TG&AAAGC CAACCATCAT GCTATTCCAT 7950 
CCAAAACGAT TTTCGGTAAA TAGCACCCAC ACCG7TGCQQ GAATTTGGCC 8000 
TATCAATTGC GCTGAAAAAT AAATAATCAA CAAAATGGCA TCGTTTTAAA 8050 
TAAAGTGATG TATACCGAAT TCAGCTTTTG TTCCOTTTAG TGAGGGTTAA 8100 

10 TTGCGCGCTT GGCGTAATCA TGGT CAT AG C TGTTTCCTGT GTGAAATTGT 3 ISO 
TATCCGCTCA CAATTCCACA CAACATACGA GCCGGAAGCA TAAAGTGTAA 32 00 
AGCCTGGGGT GCCTAATGAG TGAGCTAACT CACATTAATT GCGTTGCGCT 82 50 
CACTGCCCGG TTTCCAGTCG GGAAACCTGT CGTGCCAGCT GCATTAATGA 83 00 
ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG CGTATTGGGC GCTCTTCCGC 83 50 

15 TTCCTCGCTC ACTGACTCGC TGCGCTCGGT CGTTCGGCTG CGQCGAGCGQ 84 00 
TATCAGCTCA CTCAAAGGCG GTAATACGGT TATCCACAGA ATCAGGGGAT 84 50 
AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCA&AAGG CCAGGAACCG 85 00 
TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 35 50 
AGC AT CACAA AAATCGACGC TCAAGT C AG A GGTGGCG AAA CCCGACAGGA B600 

20 CTATAAAGAT ACCAGGCGTT TCCCCCTGGA AGCTCCCTGG TGCGCTCTCC 36 SO 
TGTTC CG ACC CTGCGGCTTA CCGGATACCT GTCCGCCTTT CTCCCTTCGG 3 7 00 
GAAGCGTGGC GCTTTCTCAT AGCTCACGCT GTAGGTATCT CAGTTCGGTG 3 7 SO 
TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 3800 
CGACCGCTGC GCGTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA 8650 

25 GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA 8900 
GCGAGGTATG TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GG C CT A ACTA 8950 
CGGCTACACT AGAAGGAGAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAG 9000 
TTACCTTCQG AAAAAGAGTT GGTAGCTCTT GATCCGGCAA ACAAACCACC 9050 
GCTGGTAGGG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 9100 

30 AA&AGGATCT CAAGAAGATC CPTTGATCTT TTCTACGGGG TCTGACGCTC 9150 
AGTGGAACGA AAACTCACGT TAAGGG ATTT TGGTGATGAG ATTATCAAAA 9200 
AGGATCTTCA CCTAGATCCT TTTA&ATTAA AAATGAAGTT TTAAATCAAT 92S0 
CTAAAGTATA TATGAGTAAA CTTGGTCTGA CAGTTACCAA TG CTT AATCA 33 00 
GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC CATAGTTGCC 9350 

35 TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT TACCATCTGG 94 00 
CCCCAGTGCT GCAATGATAC CGCGAGACCC AGGGTCACCG GCTCCAGATT 94 50 
TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT 95 00 
GCAACTTTAT CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG 95 50 
AGTAAGTAGT TGGCCAGTTA ATAGTTTGCG CAACGTTGTT GCCATTGCTA 96 00 

40 CAGGCATCGT GGTGTCACGC TCGTCGTTTG GTATGGCTTC ATTCAGCTCC 96 50 
GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT TGTG CAAAAA 9700 
AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG 97 SO 
CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC 98 00 
ATGCCATCCG TAAGATGCTT TTGTGTGACT GGTGAGTACT CAACCAAGTC 98 50 

45 ATTCTGAGAA TAGTGTATGC GGCGACCGAG TTGCT CTTGC CCGGCGTCAA 99 00 
TACGGGATAA TACCGCGCCA CAT AG C AG AA CTTTAAAAGT GCTCATCAT? 99 SO 
GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC CGCTGTTGAG 10000 
ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATCT TCAGCATCTT 10 050 
TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC 10100 

50 GCAAA&AAGG GAATAAGGGC GACACGGAAA TGTTGAATAC TC AT ACT CTT 10150 
CCTTTTTCAA TATTATTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG 10200 
GATACATaTT - SGAATGTATT TAGAAAAATA AACAAATAGG GGTTCCGCGC 10250 
ACATTTCCCC GAAAAGTGCC AC 1027 2 

55 

SEQ ID NO:3I (pTnMod (Oval/ENT tag/Proins/PA) - Chicken) 

CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG 50 

CGCAGCGTGA GCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 

60 TTTCTTCCCT TCCTTTCTCG CCACGTTCGC CGGCATCAGA TTGGCTATTG 150 

GCCATTGCAT ACGTTGTATC CAT AT CAT A A TATGTACATT TATATTGGCT 200 
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60 



CATGTCCAAC 
TAGTAATCAA 
CGTTAC&TAA 
CCCGCCCATT 
GGGAGTTTCC 
CTTGGCAGTA 



AATGACGG 
TGGGACTXTC 
CATGGTGATG 
ACTCA CGGGG 
TTTT'GGCACC 
CCCATTGACG 
GCAGAGCTCG 
TGTTTTGACC 
GGAACGGTGC 
CCGCCTATAG 
CTGTTTTTGG 
ATGGTATAGC 
CCCCTATTGG 
GCCACAACTA 
TGACACGGAC 
AATTCACATA 
CATAGCGTGG 
CTCTTCTCCG 
CTCCAGCGGC 
CCAGACTTAG 
GCCGTGGCGG 
CACGGCTGAC 
GCAGCTGAGT 
GTGCTGTTAA 
CGCGCGCGCC 
CCATGGGTCT 
TTACATGATT 
CAACAGCTTA 
CTCTTACCGA 
AACATCAAAC 
GCGACTCGCT 
GATGCCCATT 
TTATGGTATT 
T A T G A G AAAG 
CCAAT.TTCTA 
TCATTGTCAG 
AAGCXGGGTT 
AGACCTAGGA 
CATCTAGTCA 
CCAATCTCAT 
AAATCAGCGC 

ACTCAGCGTC 
GAAATTCGAA 
GATTCftAGAA 
T A CG C CAT AG 
ATCGCCCTGA 
GAAACAAGGT 
ACGTACTCTC 
TACACAATAA 
AAATTTATTC 
GATCACTTCT 
TTTTTTGTGG 



ATT AC CGC CA 

TTACGGGGTC: 

CTTACGGTAA 

GACGT CAATA 

ATTGACGTCA: 

C AT C AAGTGT 

TAAATGGCCC 

CTACTTGGCA 
OGGTTTTGGC 
ATTTCCAAGT 
AAAATCAACG 
CAAATGGGCG 
TTTAGTGAAC 
TCCATAGAAG 
ATTGGAACGC 
ACT CT A T AG G 
CTTGGGGCCT 
TTAGCCTATA 
TGACGATACT 

TCTCT&TTGG 
T CTGT ATTTT 
TACAACAACG 
GATCTCCACG 
GTAGCGGCGG 
TCATGGTCGC 
GCACAGCACA 
TAGGGTATGT 
GCAGATGGAA 
TGTTGTATXC 
CGGTGGAGGG 
ACCAGACATA 



ACGTTGGCTT 
ACTXGGCCGT 
GAATCGACCG 
G T ATAC CGTT 
GTACTTGTTG 
GCGAGCTTCA 
CGTTCCCGCT 
GCCGAGCTTG 
TGATGCTGGC 
GGTACTGGTT 
GCGGAAAACT 
CXCAAAGACT 
GCCAAATTCT 
TCGACACGGA 
GGCAAAGGAG 
CACCCAAACA 
ACCTTGCGAG 
CCGAACGAGC 
TGCTTCAACT 
TGGGACAAGC 
AACAGTTCGC 
CAAGGGAAGA 
ACACATGOTT 
GGCTAATAAA 
ATCTGCTGTG 
TGCCTTCCTT 
TTCCTAATAA AATGAGGAAA 
CT ATTCTGGG GGGTGGGGTG 

AT-. J ]hO?. mwi ! 



TGTTGAC ATT 
ATTAGTTCAT 
ATGGGCCGCC 
ATGACGTATG 
ATGGGTGGAG: 
AX CAT ATGCG 
GCCTGGCATT 
GTAGATCTAC 
AGTACATCAA 
CtrCCAGGGCA: 
GGACTTTCCA 
GTAGGGGTGT 
GGTCAGATCG 
AGACCGGGAC 
GGATTCCGCG 
CACACCCGTT: 
AXACACCCCC.. 
GGTGTCGGTT 
TTCCATTAGX 
CTATATGGCA 
TACAGGATGG 
CCGTCCCCCG 
CGAATCTCGG 
AGGTTCCACA 
TGGGCAGCTC:: 
ATGGCGACGA 
GTCTGAAAAT 
GACXTAAGGC 
TGATAAGAGT 
CAGTGTAGTC 
ATAGGTGACA 
CACCOTCGG?y 



GATTATTGAG 
AGCeCATAm 
TGGCTGACCG 
TTCCCATAGT 
TAXTTACGGT 
AAGTACGCCC 
ATGCC CAGTA 
GTATTAGTCA 

TTGACGrCAft 
AA.ATGTCGTA 
ACGGTGGGAG 



TAGTTATTAA 250 
TGGAGTTCCG 3 00 
CCGAACGAGC 3 50 



■A 



G 



r> ,- > ->-> . ■ 'iTp»f^i j£t ^ fi*C 
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CCCTCCCCCG 



GCCACGCATX 
AAGGTGCCAA 
ATTGTTAGGT 
GG CATG CT AG 
ACTGGTCTGA 
GTCGCACTAC 
TTGAGAGCAA 
GGAGCATTCT 
TTTAAAGTGC 
AAGTCGAGTA 
GG AAACCTAT 
TTAGGCTATA 
ATTGTATAAA 
CXCATTGTC^ 
CCATGGGTTC 
ACTTGTXAAX 

ACTTGAAAAG 
AGCTCAGAGC 
AAC&TCTTGG 
ACTTCCAGGC 
TTAGGCATGG 
CTTACTCGTG 
ACGCTTTGGG 
AGATCAGAGC 
CCTTCTAGTT 
GACCCTGGAA 
rXGCATCGCA 
GGGCAGCACA 



CCTGGAGAO: 
CGATCCAGCC 
TGCCAAGAGT 
TGGCTCTTAT 
GCTTCCTTAT 
AXTGACCATT 
AATCCAXAAC 
ATACTCXGTC 
GGTCCCATTX 
TGCCCGCAGT 
GTACGTGTTC 
TCCGAGCCCT 
CTTGCTCCTA 
CGAGCAGTGT 
GAGCGTGGAG 
AGCGGCAGAA 
CAGAGGTAAC 
TGAGCAGTAC 
GACTAACAGA 
GCATGTGTGA 
(jAATTACACT 
ACTTGACTGX 
GCAAAGCG AG 
AATCGTCACC 
CTTTATCTGT 
TATTCGTGAG 
ACGGTCGTTC 
TGTTCAAAGA 
ACCGAGTAAC 
GATGGTATAA 
AGAGGAAAAG 
CAGCAACTXA 
A( y A GG CT G AC 
TCTCGCTCTA 
CCACCCGTCA 
TAGCAACTAA 
ATCTATXCGA 
TCCXGCCTAC 
GTTTTGATAT 
CTTGCGGGCG 
TAACACAGTC 
AAGXTTTGCG 
GCTGCAACCC 
GAAATTATGA 
TCTAGAGATC 
GCCAGCCATC 
GGTGCCACTG : 
TTGTCTGAGT 
GCAAGGGGGA 

12S 



AACGCCAAT 
AAACTGCCCA 

JL V..-.HVV,. l ift 

TCGCTATTAC 

TAGCGGTTTG 

TGGGAGXTTG 

ACAACTCCGC 

GTGTATATAA 

CCATCCACGC 

TCCGCGGCCG 

GACGTAAGTA 

GCATGCTATA 

GCTATAGGXG 

AXTGAC C ACT 

AXGGCTCTTT 

GTTCAGAGAC 

ATTATTTACA 

TTTTATTAAA 

CGGACATGGG 

GGTCCCATGC 

ACAGTGGAGG 

GCCGCACAAG 

ATTGGGGTGG 

GAAGATGCAG 

TCCCGTTGCG 

TCGXTGCTGC 

CTGTTCCTTT 

ACTTGATATT 

AACAAAACAT 
TCCACAAAGA 



^CGGGAATAC 



CAAAAACGAC 
TGTTACTCTT 
AAGCTCATGA 
ACCACACGGC 
ATCCGTTGAG 
TACAATATGC 
CATGATATGT 
'XAAAAGCAAT 
AAGGCCGAAA 
CCTAAAATCT 



ACCTGTT 
AGCGAATGCA 
GGACTAGGCC 
CATGCTGCTA 
TTCATGCTCA 
AG A AATCGAA 
GCATTCTGGC 
T A CT AG CT C A 
TAATGATCCA 
TGTGTGTTGG 
TGTTGTTTGC 
CCACTGTCCT 
AGGTGTCATT 
GGATXGGGAA 



4 00 
4 50 
500 
K 50 
600 
6"SQ 
700 
750 
800 
S50 
500 
950 
1000 
1050 
1100 
1X50 
1200 
1230 
1300 
1350 
1400 
14 50 
1500 
X530 
16 00 
1650 
1700 

a?so 

1800 
1830 
190C 
1950 
2000 
2050 
2100 
21S0 
2200 
2250 
2 3 00 
2350 
2400 
2450 
25 00 
255 0 
26Q0 
2650 
27 00 

2S00 

2 S 5 0 
2900 
2950 
3000 
3050 
3100 

3 ISO 
3200 
3250 



GACAATAGCA GGCATGCTGG GGATGCGGTG 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT 
TTGACCCGGT GACCAA&GGT GCCTTTTATC 
CAATTACTCA GTGCCTGTTA TAAGCAGCAA 
CATCACAACA A&AACTGATT TAACAAATGG 
TATTTGAACA TTATCTXGAT TATATTATTG 
CCTATCCAAG AAGTGAIGCC TATCATTGGT 
TTAGCCTTGA ATACATTACT GGTAAGGTAA 
ATCCAAGAGA ACCAACTTAA AGCTTTCCTG 
GACCCTGAGC ACTGATGAAT CCCCTAATGA 
TTAAGGTGG& TACACATCTT GTC&TATGAT 
CACTCATTAG GCACCCC&GG CTTTACAGTT 
TGTGTGGAAT TGTGAGCGGA TAACAATTTC 
CATGATTACG CCAAGCGCGC AATTAACCCT 
GGAGCTCCAC GGCGGTGGCG GCCGCTCTAG 
AGGTCAGAAT GGTTTCTTTA CTGTTTGTCA 
AACAATACSCT TCTATAACTG AAATATATTT 
TCCCTCGAAC CATGAACACT CCTCCAGCTG 
ATCTGCCAGG CCATTAAGTT ATTCATGOAA 
AGTTCATATC ATAAACACAT TTGAAATTGA 
GAGCTAXGTT TTGCTGTATC CTCAGAAAAA 
ACACCCATAA AA&GATAGAT TTAA&TATTC 
CGTCTGCTCT TCACT CTAGT CTCAGTTGGC 
TTATTTCTCC TATTTTGTCA AGAAAATAAT 
TTATGTCCTG CCTAGCATGG CTCAGATGCA 
TCAAATGAAA CAGACXTCTG GT CTGTT ACT 
ACTAACXAAT AATTGCTAAT TATGTTTTCC 
TTTCTGTTTT CTTAAAGATC CCATTATCTG 
GAACATGAGC AATATTTCCC AGTCTTCTCT 
GATTAGCAGA ACAGGCAGAA AACACATTGT 
TATTTGCTCT CCATTCAATC CAAAATGGAC 
CCCAATCCCA TTAAATGATT TCTATGGCGT 
GGAAC CTGTG GGTGGGTC&C AATTCAGGCT 
CGGAT.CCATG GGCTCCATCG GCGCAGCAAG 
TATTCAAGGA GCTCAAAGTC CACCATGCCA 
CCCATTGCCA TCATGTCAGC TCTAGCCATG 
CAGCACCAGG ACACAGATAA ATAAGGTTGT 
GATTCGGAGA CAGTATTCAA GCTCAGTGTG 
TCTTCACTTA GAGACATCCT CAACCAAATC 
TTCGTTCAGC CTTGCCAGTA GACTTTATGC 
TGCCAGAATA CTTGCAGTGT GTGAAGGAAC 
C C TAT C A ACT TTCAAAGAGC TG C AG AT C AA 
CTGGGTAGAA AGTCAGACAA ATGG&ATTAT 
GCTCCGTGGA TTCTCAAACT GCAATGGTTC 
AAAGGACTGT GGGAGAAAAC ATTTAAGGAT 
TTTCAGAGTG AGTGAGCAAG AAAGCAAACC 
TTGOTTTATT TAGAGTGGCA TCAATGGCTT 
GAGCTTCCAT TTGCCAGTGG GACAATGAGC 
TGAAGTCTCA QGCCTTG AG C AGCTTGAGAG 
TGACTGAATG GACCAGTTCT AATGTTATGG 
TACTTACCTC GCATGAAGAT GGAGGAAAAA 
AATGGCTATG GGCATTACTG ACGTGTTT&G 
GCATCTCGTC AGCAGAGAGG CTGAAGAT&T 
CATGCAGAAA TCAATGAAGC AG G C AG AG AG 
TGGAGTGGAT GCTGC&AGCG TCTCTGAAGA 
TCCTCTTCTG TATCAAGGAC ATCGCAACCA 
AGATGTGTTT CCCCTCCQCG GCCAGCAGAT 
ACCAGCAGAT GACGCACCAG CAGATGACGC 
CAGATGACGC AACAACATGT ATCCTGAAAG 
CTGCTGGATG ACGATGACAA ATTTGTGAAC 



GGGTCTATGG GTACCTCTCT 3 3 00 
CTCTCGGTAC CTCTCTCTCT 3 3 SO 
CGGTACCAGG TGCTGAAGAA 34 00 
ATCACTTTAA AAATAAAAAA 3450 
TTAATTATGA TTGATGCCXA 3 500 
TTGGTCTGCC TTAGAAAGTA 3 5 50 
ATAATAATAA AAACCTTATC 3 600 
TGGAATGAAC TTGAAAAAAA 3650 
ACGCCATT8T CAGCAAATTG 3 700 
ACGGAATGTT AATTCTCGTT 3 7 50 
TTTTGGTAAA AATCATTAAG 3 300 
CCCGGTAATG TGAGTTAGCT 3 850 
TATGCTTCCG GCTCGTATCT 3 900 
ACACAGGAAA CAGCTATGAC 3 950 
CACTAAAGGG AACAAA&GCT 4 000 
AACTAGTGGA TCCCCCGGGG 4 050 
ATT CT ATT AT XT CA AT AC AG 4100 
GCTATTGTAT ATTATGATTG 4150 
AATTTCACAA TTCCTCTGTC 4 200 
GATCTTTGAG GAACACTGCA 4250 
GTATTGTTTT GCATTGTATG 4 300 
AAGTTTGTTA TAAAGCATTC 43 50 
CAGCTATAGG &AAGAAAGTG 4 4QG 
TCCTTCACAT GCATGCTTCT 4 450 
AGGTCACGTC TTGTTCTCAC 4 500 
CGTT G TAG AT ACAAGAAGGA 4 550 
AC AAC CAT AG TAATAAGCAC 4 600 
ATCTCTAAGG TTCCCACATT 4 650 
GTTGTAACTG AAGCTCAATG 4 700 
CCCATCCAAC AGTCCTGATG 4 750 
TACCCAGAAT TAAAAACTAA 4 800 
CTATTGAAAC TAAA&TCTAA 4 850 
CAAAGCTCAA ACTTCTGAAG 4 900 
ATATATTCCC CAGGGCTCAG 4 950 
CATGGAATTT TGTTTTGATG 5 000 
ATGAGAACAT CTTCTACTGC 505 0 
GTATACCTGG GTGCAAAAGA 5100 
TCGCTTTGAT AAACTTCCAG 515 0 
GCACATCTGT AAACGTTCAC 5200 
ACCAAAGCAA ATGATGTTTA 525 0 
TGAAGAGAGA TACCCAATCC S300 
TGTATAGAGG AGGCTTGGAA 5350 
GCCAGAGAGC TCATCAATTC 5400 
CAGAAATGTC CTTCAGCCAA 5450 
TGGTTAATGC CATTGTCTTC 5500 
GAAGACACAC AAGCAATGCC 5 55 0 
TGTGCAGATG AT GT AC C AG A 5600 
CTGAGAAAAT GAAGATCCTG 5 65 0 
ATGTTGGTGC TGTTGCCTGA 57 00 
TATAATCAAC TTTGAAAAAC 575 0 
AAGAGAGGAA GATCAAAGTG 5300 
TACAACCTCA CATCTGTCTT 5950 
CTCTTCAGCC AATCTGTCTG 5900 
CTCAAGCTGT CCATGCAGCA 595 0 
GTGGTAGGGT CAGCAGAGGC $000 
ATTTAGGGCT G AC CAT C CAT 605 0 
ACGCCGTTCT CTTCTTTGGC 6 10 0 
GACGCACCAG CAGATGACGC 615 0 
ACCAGCAGAT GACGCACCAG 6200 
G CTCT TGTGG CTGGATCGGC 62 50 
CAACACCTGT GCGGCTCACA 63 00 
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CCTGGTGGAA 
CACCCAAGAC 
CTGGGCGGGG 
GTCCCTGCAG 
CCCTCTACCA 
ACTTCTGGCT 
TTGTGGATCT 

V* Vv" VV V*- V** VJ -in VS V* \f* 

TAATAAAATG 

TOTGGGGGGT 

ATAOCAGGCA 

CTCTGTCTCT 

G G C C CGG T AC 

GGCCGTCQTT 

TTAATCGGCT 

GAGGCCCGCA 

ATGGAAATTG 

TTAAATCAGC 

ATAAATCAAA 

AACAAOAGTC 

&AC CGTCT AT 

GTGTATCCAC 

TCAGT-GCTCA 

TGATGATGTG 

AC ATG CG AAA 

CTATTTACCG 

GTTTTATTTG 

TATCAAGAGG 

TAACTAAGCA 

TGAAGGTCAT 

ATCCAAATCC 

AAACAGTAAT 

ATCCCTGTA& 

CCCTGTAATG 

AGGGAAAACT 

TACTATCTGC 

GTGGCTATTC 

AAGACCCGCT 

TTT CGGT AAA 

GCTGAAAAAT 

TATACCGAAT 

GGCGTAATCA 

CAATTCCACA 

GCCTAATGAG 

TTTCCAGTCG 

GCGCGGGGAG 

ACTGACTCGC 

CTCAAAGGCG 

AGAACATGTG 

GCGTTGCTGG 

AAA TGG ACQ C 

ACCAGGCGTT 

CTGCCGCTTA 

GCTTTCTCAT 

GCTCCAAGCT 

GCCTTATCCG 

ATCGCCACTG 

TAGGCGGTGC 



GCTCTCTAC( 
CCGCCGGGAG 
GCCCTGGTGC 
AAGCGTGOCA 
GCTGGAGAAC 
AATAAAAG&T 
GCTGTGCCTT 
TTCCTTGACC 
AGGAAATTGC 
GGGGTGGGGG 
TGCTGGGGAT 
CTCTCTCTCT 
CCAATTCGeC 
TT&CAACGTC 
TGCAGCACAT 
CCGATCGCCC 
TAAGCGTTAA 

tv ^y^rt^ v> 
J- 1 A i i **• 

AGAATAGAGC 
CACTATTAAA 
CAGGGCGATG 
C TT AA CTTAA 
GGGTCAACGA 
CTTAAAAAGT 
AACCTAAAAG 
CGGCTTTTTA 
AAGCTAAATC 
GTCATTATAT 
CTTGTCTCCT 
CGATAGCAGG 
AGCCATCCCA 
GGGCCAATAA 
AGCAGCTTGC 
CAGGTAAAGC 

AATAAATCCG 
TTCCTGCCAC 
AATGAAAAGC 
TAGCACCCAC 

TCAGCTTTTG 
TGGTCATAGC 
CAACATACGA 
TGAGCTAACT 
GGAA&CCTGT 
AGGCGGTTTG 
TGCGCTCGGT 
GT&ATACGGT 
AGCAAAAGGC 
CGTTTTTCCA 
T C AAGTC AG& 
TCCCCCTGGA 
C CGG AT AG CT 
AGCTCACGCT 
GGGCTGTGTG 
GTAAGTATGG 
GCAGCAGCCA 
TACAGAGTTC 



60 



AGAAGGACAG TATTTGGTAT 
AAAAAGAGTT GGTAGCTCTT 
GTGGTTTTTT TGTTTGCAAG 

ATUiclC?. : 53**2.5 



TAGTGTGCGG 
GCAGAGGACC 
AGGCAGOCTG 
TTGTGGAACA 
TACTGCAACT 
CAGAGCTCTA 
CTAGTTGCCA 
CTGOAAGGTG 
ATCOCATTGT 
AGCACAGCAA 
GCGCcXGGGCT 
CTCTCTCTCT 
CTATAGTGAG 
GTGACTGGGA 
CCCCCTTTCG 
TTCCCAACAG 
TATTTTGTTA 
ACCAATAGGC 
GAGATAGGGT 
G AA C G TGG AC 
GCCCACTACT 
TGATTTTTAC 
G&ATTAACAT 
TACTCAATGG 
AGCTTGCCGA 
TTGAGCTTGA 
TTCTTTATCG 
TTCGCGGAAT 
GTTTACTCCC 
ATAATAATAC 
AATTGGTAGT 
CACCGGTTGC 
TGATGACTCT 
GATCCCACCA 
CAGAT&TAAA 
AGCAGTACTG 
AAAGG OTTGG 
CAACCATCAT 



CAAAATGGCA 
TTCCCTTTAG 
TGTTT CCTGT 
GCCGGAAGCA 
CACATTAATT 
CGTGCCAGCT 
CGTATTGGGC 
CGTTCGGCTG 
TATCCACAGA 
CAGCAAAAGG 
TAGGCTCCGC 
GGTGGCGAAA 
AGCTCCCTCG 
GTCCGGCTTT 
GTAGGTATCT 
CACGAAOCCC 
TCTTGAGTCC 
CTGGTAACAG 
TTGAAGTGGT 
CTGCGCTCTG 
GATCCGGCAA 
CAGCAGATTA 



GGAACGAGGC 

TGCAGGTGGG 

CAGCCCTTGG 

ATGCTGTACC 

AGGGCGCCTG 

GAGATCTGTG 

GCCATCTGTT 

CC&CTCCCAC 

CTGAGTAGGT 

GGGGGAGGAT 

CTATGGGTAC 

CG6TA.CCTCT 

TCGTATTACG 

AAACCCTGGC 

CCAGCTGGCG 

TTGCGCAGCC 

AAATTCGCGT 

CGAAATCGGC 

TGAGTGTTGT 

TCCAACGTCA 

C OG GG AX C AT 

CAAAATCATT 

TCCGTGAGGA 

CTGGTTATGC 

T AAAAAAGG C 

AAGATAAATA 

TAAAAAATGC 

A&CATCATTT 

CTGAGCTTGA 

AGTAAAACGC 

GAATGATTAT 

ATTGGTAAGG 

TTGTTTGGAT 

CCAGCCAATA 

GGCTAftAAAG 

CGGTTTTTTC 

AATACTGAGT 

GCTATTCCAT 

GAATTTGGCC 

TCGTTTTAAA 

TGAGGGTTAA 

GTGAAATTGT 

TAAAGTGTAA 

GCGTTGCGCT 

GCATTAATGA 

GCTCTTCCGC 

CGGCGAGCGG 

AT C AGGGG AT 

CCAGGAACCG 

CCCeCTGACG 

CCGGACAGGA 

CAGTTCGGTG 
CCGTTCAGCC 
AACCCGGTAA 
G ATT AG C AG A 
GGCCTAACTA 
CTGAAGCCAG 
ACAAACCACC 
CGCGCAGAAA 
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TTCTTCTACA 
GCAGGTGGAG 
CCGTGGAGGG 
AGCATCTGCT 
GATCGAGATC 
TGTTGGTTTT 
GTTTGCCCCT 
TGTCCTTTCC 
GTCATTCTAT 
TGGGAAGACA 
CTCTCTCTCT 
CTGGAGGGGG 
CGGGCTCACT 
GXTACCCAAC 
TAATAGCGAA 
TGAATGGCGA 
TAAATTTTTG 
AAAATCCCTX 
TGCAGTTTGG 
AAGGGCGAAA 
ATGACAAGAT 
AGGGGATTCA 
AAGCTTATGA 
ATATCGCAAT 
CAATTTATTG 
AAATAGATAG 
CCTCTTGGGT 
GGTGACGAAA 
GGGGTTAACA 

AAATAACAGC 
CTCACCAATA 
AG ACAT C ACT 
AAATT AAAAC 
GCAAATGCAC 
GCCCCATTTA 
GXAAAAGACC 
C C A^tAA CG A> . t 
TATCAATTGC 
TAAAGTGATG 
TTGCGCGCTT 
TATGCGCTCA 
AGCCTGGGGT 
CACTGCCC6C 
ATCGGCCAAC 
TTCCTCGCTC 
TATCAGCTCA 
AACGCAGGAA 
TAAAAAGGCC 
AGCATCACAA 
CTATAAAGAT 
TGTTCCGACC 
GAAGCGTGGC 
TAGGTCGTTC 
CGACCGCTGC 
GACACGACTT 
GCGAGGTATG 
CGGCTACACT 
TTACCTTCGG 
GCTGGTAGCG 
AAAAGGAT CT 



6350 
6400 
6450 
6500 

65 SO 
6600 

66 50 
6700 
6750 
6800 
S8SG 

6 900 
6350 
7000 
7030 
7100 
7150 
7200 
72S0 
7300 

73 50 

74 00 
74 50 
7500 
7650 
76 00 
7650 

7 7 00 
7750 
7800 
7S50 
7900 
7350 

eooo 

8050 
B100 
81S0 
8200 
6250 
83 00 
8350 
8400 
8450 
8500 

8 550 
8600 
$630 
8700 
8750 
6800 
6850 
SS>00 
8950 
900G 
9050 
9100 
9150 
9200 
9250 
S30C 
9350 



WO 2004/003157 



PCT/US2003/G20389 



CAAGftAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC AGTGGAACGA 94 00 

AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATC&AAA AGGATCTTCA 94 SO 

CCTAGATCCX TTTAAATTAA AAATGAAGTT TTA&ATCAAT CXAAAGTATA 95 00 

TATGAGTAAA CTTGOTCTGA CAGTTACCAA TGCTTAATCA GTGAGGCACG 9550 

5 TATCTCAGCG ATCTGTCTAT TTCGTTCATC CATAGTTOCC TGACTCCCCG 96 00 

TCGTGTAGAT AACTACGATA CGGGAGGGCT TACCATCTGG CCCCAGTGCT 9650 

GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT TATCAGCAAT 97 00 

AA&CCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGOTCCT GCAACTTTAT 9750 

CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT 9800 

10 TCGCCAGTTA AT AGTTTG CG CAACGTTGTT GCCATTGCTA C&GGCATCGT 935 0 

GGTQTCACGC TCGTCGTTTG GTATGGCTTC ATTCAGCTCC GGTTCCCAAC 9900 

GATCA&GGCG AGTTACATGA TCCCCCATGT TGTGCAAAAA AGCGGTTAGC 9950 

TCCTTCGGTC CTCCGATCGT TGTCAGA&GT AAGTTGGCCG CAGTGTTATC 10000 

ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC ATGCCATCCG 10 05 0 

15 TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA 10100 

TAGTGTATGC GGCGACCGAG TTGCTCTTGC CCGGCGTCAA TACGGGATAA 10X50 

TACCGCGCCA GAT AG C AG AA CXTTAAAAGT GCTCATCATT GGAAAACGTT 10200 

CTTCGGGGCG AAAACTCT C A AGGATCTTAC CGCTGTTGAG ATCCAGTTCG 102 SO 

ATGTAACCC& CTCGTGCACC CAACTGATCT TCAGCATCTT TTACTTTCAC 2 0300 

20 CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GC AAAAT G CO GCAAAAAAGG 10350 

G&ATA AGGGC GAC&CGGAAft TGTTGAATAC TCATACTCTT CCTTTTTCAA 10400 

TATTATTGAA GCATTTATCA GGGTTATTGT CT CAT G AG CG GATACATATT 10450 

TGAATGTATT TAGAAAAATA AACAAATAGG GGTTCCGCGC ACATTTCCCC 10500 

GAAAAGTGCC AC IDS 12 

25 

SEQ ID &0:32 (pTnMod {Oval/ENT tag/Prcdns/PA) ~ QUAIL) 

CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG 50 

30 CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 

TTTCTTCCCT TCCTTTCTCG CCACGTTCGC CGGCATCAGA TTGGCTATTG 150 

GCCATTGCAT ACGTTGTATC CAT AT C AT AA TATGTACATT TATATTGGCT 200 

CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC TAGTTATTAA 250 

TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG 3 00 

35 CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 3 50 

CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 4 00 

GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 4 50 

CTTGGCAGTA CAT CAAGTGT ATCATATGCC AAGTACGCCC CCTATTGACG 500 

TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA 5 50 

40 TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 600 

CATGG.TGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 650 

ACTCACGGGG ATTTCCA&GT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 700 

TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC 750 

CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA 800 

45 GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG CCATCCACGC 850 

TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 900 

GGAACGGTGC ATTGGAACGC GGATTCCCCG TGCCAAGAGT GACGTAAGTA 950 

CCGCCTATAG ACTCTATAGG &CACCCCTT TGQCTCTTAT GCATGCTATA 1000 

CT GTTTTTQG CTTGGGGCCT ATACACCCCC GCTTCCTTAT GCTATAGGTG 1050 

50 ATGGT AT AG C TTAGCCTATA GGTGTGGGTT ATTGACCATT ATTGACCACT 1100 

eCCCTATTGG TGACGATACT TTCCATTACT AATCCATAAC ATGGCTCTTT 1150 

GCCACAACTA TCTCTATTGG CTATATGCCA AT ACT CTGT C CTTCAGAGAC 1200 

TGACACGGAC TCTGTATTTT TACAGGATGG GGTCCCATTT ATTATTTACA 1250 

AATTCACATA TAC&ACAAGG CCGTCCCCCG TGCCCGCAGT TTTTATTAAA 13 00 

55 CATAaCGTGG GATCTCCACG CGAATCTCGG GTACGTGTTC CGGACATGGG 1350 

CTCTT.CTCCG GTAGCGGCGG AGCTTCCACA TCCGAGCCCT GGTCCCATGC 1400 

CTCCAGCGGC TCATGGTCGC TCGGCAGCTC CTTGCTCCTA ACAGTGGAGG 14 50 

CCAGACTTAO GCACAGCACA ATGCCCACCA CCACCAGTGT GCCGCACAAG 1S00 

GCCGTGGCGG TAGGGTATGT GTCTGAAAAT GAGCGTGGAG ATTGGGCTCG 1550 

60 CACGG CTGAC GCAGATGGAA GACTTAAGGC AGCGGCAGAA GAAGATGCAG 16 0 0 

GCAGCTGAGT TGTTGTATTC TGATAAGAGT CAGAGGTAAC TCCCGTTGCG 1650 
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GTGCTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTTGCTGC 17 00 
CGCGCGCGCC ACCAGACATA ATAGGTGACA GACTAACAGA CTGTTCCTTT 1750 
CCATGGGTCT TTTCTGGAGT GACCGTCGGA CCATGTGTGA ACT TO AT ATT 18 00 
TTACATGATT CTCTTTACCA ATTCTGCCCC GA&TTACACT TAAAACGACT 1850 
5 CAACAGCTTA ACGTTGGCTT GCCACGCATT ACTTGACTGT AAAACTCTCA 1900 
CTCTTACCGA ACTTGGCCGT AACGTGCCAA CCAAAGCGA.G AACAAAACAT 1950 
AACATCAAAC GAATCGACCG ATT^TTAGGT AATCGTCACC TCCACAAAGA 20 00 
GCGACTCGCT GTATACCGTT GGCATGCTAG CTTTATCTGT TCGGGAATAC 2050 
GATGCCCATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG GRAAAACGAC 2 1 DO 
1 0 TTATCGTATT GCGAGCTTCA GTCGCACTAC ACGGTCGTTC TGTTACTCTT 21 50 
TATGAGAAAG CGTTCCCGCT TT C AG AG CAA TGTTCAAAGA AAGCTGATGA 22 00 
CCAATTTCTA GCCGACCTTG GG AGCATTCT ACCGAGTAAC ACCACACCGC 3250 
TCATTGTCAG TO ATGCTGG C TTT AAAGTGC \ CATGGTATAA ATCCGTTGAG 23 0 0 
AAGCTGGGTT GGT A CTGGTT AAGTCGAGTA AGAGGAAAAG TACAATATGC 2350 
15 AGACCTAGGA GCGGAAAACT GGAAACCTAT CAGCAACTTA CATGATATGT 2400 
CATCTAGTCA CTC AAAGACT TT AGGCTAT A AGAGG CTGAC TAAAAGCAAT 2450 
CCAATCTCAT GCCAAATTCT ATTGTATAAA TCTCGCTCTA &AGGCCG AAA 2500 
AAATCAGCGC TCGACACGGA CTCATTGTGA CCACCCGTCA CCTAAAATCT 2 550 
ACTCAGCGTC GGCAAAGGAG CCATGGGTTG TAG C A ACT A A CTTACCTGTT 2600 
20 GAAATTCGAA CACCCAAACA ACTTGTTAAT ATCTATTCGA AGCGAATGCA 2650 
GATTGAAG&A &CCTTCCGAG ACTCGAAAAG TCCTGCCTAC GG ACT AGG CC 2700 
TACGCCATAG CCG&ACGAGC AGCTCAGAGC GTTTTGATAT C ATGCTG GTA 2 750 
ATCGCCCTGA TGCTTCAACT AACATGTTGG CTTGCGGGCG TTCATGCTCA 2&QQ 
GAAACAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC AGAAATGGAA 2850 

25 ACGTACTCTC AACAGTTCGC TTAGGCATGG AAGTTTTGCG GCATTCTGGC 2 900 
TACACAATAA CAAGGGAAGA CTTACTCGTG GCTGCAACCC TACXAGCTCA 2 350 
AAATTTATTC ACACATGGTT ACGCTTTGGG GAAATTATGA TAATGATCCA 3 000 
GATCA CTTCT GGCTAATAAA AGATCAGAGC ; TCTAGAGATC TGTGTGTT GG 3 050 
TTTTT TGTGG ATCTGCTGTG CCTTCTAGTT GCCAGCCATC TGTTGTTTGC 3100 

30 CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT 3150 
TTCCT'AAT&A AATGAGGAAA T"iGCATCGCA TTGTCTGAGT AGGTGTCATT 3200 
CTATTCTGGG GGGTGGGGTG GG G C AG C AC A GCAAGGGGGA GGATTGGGAA 3 250 
GACAATAGCA GGC ATGCTGG GGATGCGGTG GGCTCTATGG GTACCTCTCT 3 300 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCGGTAC CTCTCTCTCT 3350." 

35 CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCAGG TGCTGAAGAA 3400 
TTGACCCGGT G A C CAAAG GT GCCTTTTATC ATCACTTTAA AAATAAAAAA 34 50 
CAATTACTCA GTGCCTGTTA TAAGCAGCAA TTAATTATGA TTGATGCCTA 3500 
CATCACAACA AAAACTGATT XAACAAATGG TTGGTCTGCC TTAGAAAGTA 3 550 
TATTTGAACA TTATCTTGAT TATATTATTG ATA&TAATAA AAACCTT AT C 3600 

40 CCTATCCAAG AAGTGATGCC T&TCATTGGT TGGAATGAAC TTGAAAAAAA 3650 
TTAGC CTTGA ATACATTACT GGTAAGGTAA ACGCCATTGT CAGCAAATTG 3 700 
ATCCAAGAGA ACCAACTTAA AG CTTTC CTG ACGGAATGTT AATTCTCGTT 3 750 
GACCCTGAGC ACTGATGAAT CCCCTAATGA TTTTGGTAAA AATCATTAAG 3300 
TTAAGGTGGA TACACATCTT GTCATATGAT CCCGGTAATG TGAGTT AGCT 3 8 5 0 

45 CA CTC ATT AG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT 3900 
TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC 3 95 0 
CATGATTACG CCAAGCGCGC AATTAACCGT CACTAAAGGG AACAAAAGCT 4000 
GGAGCTCCAC CGCGGTGGCG GCCGCTCTAG AACTAGTGGA TCCCCCGGGG 4 050 
AGGTCAGAAT GGTTTCTTTA CTGTTTGTCA ATTCTATTAT TTC&ATACAG 4100 

50 AACAAAAGCT TCTATAACTG AAJVPATATTT GCTATTGTAT &TTATGATTG 4150 
TCCCTCGAAC C AT G AAC ACT CCTCCAGCTG AATTTCACAA TTCCTCTGTC 4 200 
ATCTGCCAGG CTGGAAG&TC ATGGAAGATC TCTGAGGAAC ATTGOWJTT 4 25 0 
C ATA C CAT A. A ACTCATTTGG AATtG AGTAT TATTTTGCTT TGAATGGAGC 4 300 
T ATGTTTTG C AG TT C CCT C A GAAGAAAAGC TTGTTATAAA GCGTCTACAC 4 350 

55 CCATCVVAAG ATATATTtAA ATATTCCAAG TACAGAAAGA TTTTGTCTGC 4 4 00 
TCTTCACTCT GAT CTC AGTT GGTTT CTTGA CGTACATGCT TCTTTATTTG 4450 
CCTATTTTGT CAAGAAAATA ATAGGTCAAG TCCTGTTGTC ACTTATCTCC 4 500 
TG CCT AG CAT GGCTTAGATG <::ACGTTGTAC ATTCAAGAAG GATCAAATGA 4550 
AACAGACTTC TGGTCTGTTA C A AC A A CC AT AGTAATAAAC AGACTAACTA 4 600 

60 ATAATTGCTA ATTATGTTTT CCATCTCTAA GGTTCCCACA TTTTTCTGTT 4 65 0 
TTAAGATCCC ATTATCTGGT TGTAACTGAA GCTCAATGGA ACATGAACAG 4 7 00 
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TATTTCTCAG TCTTTTCTCC AGCA&TCCTG ACGGATTAGA AGAACTGGCA 4750 
GAAAACACtt TGTTACCCAG AATTAAAAAC TAATATTTGC TCTCCCTTCA 4 8 00 
ATCCAAAATG GACCTATTGA AACTAAAATC TGACCCAATC CC ATTAAATT 4S50 
ATTTCTATGG .CGTCAAAGST CAAACTTTTG AAGGGAACCT GTGGGTGGGT 4 900 
5 CCCAATTCAG GCTATATATT GCCCAGGGCT CAGCCAGTGG ATCCATGGGC 4 $50 
TCCATCGGTG CAGCAAGCAT GG&AT7TTGT TTTGATGTAT TCAAGGAGCT 5000 
CAAAdTCCAC CAT0CCAATO ACAACATGCT CTACTCCCCC TTTGCCATCT 5050 
TGTCAACTCT GGCCATGGTC TTCCTAGGTG CA&AAGACAG CACCAGGACC 5100 
CAGATAAATA AGGTTGTTCA CTTTGATAAA CTTCCAGGAT TCGGAGACAG 5150 

10 TATTGX&GCT CAGTGTGGCA CATCTGTAAA TGTTCACTCT TCACTTAGAG 52 00 
ACATACTCAA CCAAATCACC AAACAAAATG ATGCTTATTC GTTCAGCCTT 52 SO 
GCCAGTAGAC TTTATGCTCA AG AG A CAT AC ACAGTCGTGC CGGAATACTt 53 00 
GCAATGTGTG AAGGAACTGT ATAGAGGAGG CTTAGAATCC GTCAACTTTC S3 50 
AAACAGCTGC AGATCAAGCC AGAGGCCTCA TCAATGCCTG GG^AGAAAGT 5400 

IS CAGACAAACG GAAlTATCAG AAACATCCTT CAGCCAAGCT GCGTGGATTC 54 50 
TCAAACTGCA ATGGTCCTGG TTAATGCCAT TGCCTTCAAG GGACTGTGGG 5500 
AGAAAGCATT TAAGGCTG&A GACACGCAA& C AATAC CTTT CAG&GTGACT 5550 
GAGCAAGAAA GGAAACCTGT GCAGATGATG TACCAGATTG GTTCATTT&A 5600 
AGTGGCATCA ATGGCTTCTG AGAAAATGAA GATCCTGGAG CTTCCATTTG 5650 

20 CCAGTGGAAC AATGAGCATG TTGGTGCTGT TGCCTGATGA TOTCTCAGGC 5700 
CTTOAC3CAGC TTGAGAGTAT AATCA6 CTTT GAAAAACTGA CTGAATGGAC 57SC 
CAGTTCTAGT ATTATGGAAG AGAGGAAGGT CAAAGTGTAC CTACCTCGCA 5800 
TGAAG'ATGGA GGAGAAATAC AAC GT C AC AT CTCTCTTAAT GGCTATGGGA 5850 
ATTACTGACG TGTTCAGCTC TTCAGCCAAT CTGTCTGGCA TCTCCTCAGT 5900 

25 AGGGAGCCTG AAGATATCTC AAGCTGTCCA TGCAGCACAT GCAGAAATCA SS5G 
ATGAAGCGGG CAGAGATGTG GTAGGCTCAG CAGAGGCTGG AGTGGATGCT 6 0 00 
ACTGAAGAAT TTAGGGCTGA CCATCCATTC CTCTTCTGTG TCAAGCACAT 6050 
CGAAACCAAC GCCATTCTCC TCTTTGGCAG ATGTGTTTCT CCGCGGCCAG 6100 
CAGATGACGC ACCAGCAGAT GACGCACCAG CAGATGACGC AC CAGG AG AT 6150 

30 GACGCACCAG CAGATGACGC ACCAGCAGAT GACGCAACAA CATGTATCCT €200 
GAAAGGCTCT TGTGGCTGGA TCGGCCTGCT GGATGACGAT GACAAATTTG 6250 
TGAACCAACA CCTGTGCGGC TCACACCTGG TGGAAGCTCT CTACCTAGTG S3 00 
TGCGGGGAAC GAGGCTTCTT CTACACACCC AAGACCCGCC GGGAGGCAGA S3 SO 
GGACCTGCAG GTGGGGCAGG TGGAGCTGGG CGGGGGCCCT GGTGCAGGCA 64 00 

35 GCCTGCAGCC CTTGGCCCTG GAGGGGTCCC TGCAGAAGCG TGGCATTGTG 64 50 
GA&CAATGCT GTAGCAGCAT CTGCTCCCTC TACCAGCTGG AGAACTACTG 65 00 
CAACTAGGGC GCCTGGATCC AGATCACTTC TOOCTAATAA AAGATCAG AG 6 5 5 0 
C T CT AG AG AT CTGTGTGTTG GTTTTTTGTG GATCTGCTGT GCCTTCTAGT 66 00 
TGCCAGCCAT CTGTTGTTTG CCCCTCCCCC GTGCCTTCCT TGACCCTGGA 6650 

40 AGGTGC C ACT CCCACTGTCC TTTCCTAATA AAATGAGGAA ATTGCATCGC 67 00 
ATTGTCTGAG TAGGTGTCAT TCTATTCTGG GGGGTGGGGT GGGGCAGCAC 6 750 
AGCAAGGGGG AGGATTGGGA AGACAATAGC AGGCATGCTG GGGATGCGGT 6800 
GGGCTCTATG GGT&CCXCTC TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC 6 850 
TCTCTCGGTA CCTCTCTCGA GGGGGGGCCC GGTACCCAAT TCGCCCTATA 6 900 

45 GTGAGTCGTA TTACGCGCGC TCACTGGCCG TCGTTTTACA ACGTCGTGAC 6950 
TGGGAAAACC CTGGCGXTAC CCAACTTAAT CG CCTTGC AG CACATCCCCC 7000 
TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT CGCCCTTCCC 7050 
AACAGTTGCG CAGCCTGAAT GGCGAATGGA AATTGTAAGC GTTAATATTT 7 100 
TGTTAAAATT CGCGTTAAAT TTTTGTTAA& TCAGCTCATT TTTTAACCAA 7 ISO 

50 TAGGCCGAAA TCGGCAAAAT CCCTTAXAAA TCAAAAGAAT AG AC CG AG AT 72 00 
AGGGTTGAGT GTTGTTCCAG TTTGGAACAA GAGTCCACTA TTAAAGAACG 7250 
TGGACTCCAA CGTGAAAGGG CG AAA AAC CG TCTATCAGGG CGATGGCCCA 73 00 
CTACTCCGGG ATCATATGAC AAGATGTGTA TCCACCTTAA CTTAATGATT 7350 
TTTACCAAAA TCATTAGGGG ATT CAT CA GT GCTCAGGGTO AACGAGAATT 74 00 

55 AACATTCCGT CAGGAAAGCT TATGATGATG ATGTGCTTAA AAACTTACTC 74SC- 
AATGGCTGGT TATGCATATC GCAATACATG CG AAAAAC CT AAAAGAGCTT 7500 
GCCGATAAAA AAGGCCAATT TATTGCTATT TACCGCGGCT TTTTATTGAG 75 50 
CTTGAAAGAT AAATAAAATA GATAGGTTTT ATTTGAAGCT AAATCTTCTT 76 00 
TATCGTAAAA AATGCCCTCT TGGGTTATCA AGAGGGTCAT TATATTTCGC 7 6 50 

60 GGAATAACAT CAT TTGGTG A CGAAATAACT AAGCACTTGT CTCCTGTTTA 7 7 00 
CTCCCCTGAG CTTGAGGGGT TAACATGAAG GT CAT CG AT A GCAGGATAAT 7 7 50 
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AATACAGTAA AACGCTAAAC CAATAATC CA A&TCCAGCCA TCCCAAATTG 78 00 
GTAGTGAATG ATTATAAATA ACAGCAAACA GTAATGGGCC AATAACACCG 7850 
GTTGCATTGG TAAGGCTCAC CAATAATCCC TGTAAAGCAC CTTGCTGATG 7900 
ACTC1TTOTT TGGATAGACA TCACTCCCTG TAATGCAGGT AAAGCGATCC 7950 
5 CACCACCAGC CAATAAA&TT AAAACAGGGA AAACTAACCA AC CTTC AG AT 8000 
AT A Aft CGCTA AAAAGGCAAA TGCACTAGTA TCTGCAATAA ATCCGAGCAG 8050 
TACTGCCGTT TTTTCGCCCC ATTTAGTGGC TATTCTTCCT GC G AC AAAGG 8100 
CTTGGAATAC TGAGTGTAAA AGACCAAGAC CCGCTAATGA AAAGCCAACC SI 50 
AT C ATG CT AT TCCATCCAAA ACC4ATTTTCG GTAAATAGCA CCCACACCGT 8200 

10 TGCGGGAATT TGGCCTATG A ATTGCGCTGA AAAATAAATA ATCAACAAAA 8250 
TGGCATCGTT TTAAATAAAG TGATGTATAC CGAATTCAGC TTTTGTTCCC 83 00 
TTtAGTG&GG GTTAATTGCG CGCTTGGCGT AATCATGGTC ATAGCTGTTT 63 50 
CCTGTGTGAA ATTGTTATCC GCTCACAATT CCACACAACA TACGAGCCGG 84 00 
AAGCATAAAG TGTAAAGCCT GGGGTGGCTA ATGAGTGAGC TAACTCACAT B4S0 

15 TAATTGCGTT GCGCTCACTG CCCGCTTTCC AGTCGGGAAA CCTGTCGTGC 8500 
CAGCTGCATT AATGAATCGG CCAACGCG CG GGGAGAGGCG GTTTGCGTAT 8550 
TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC TCGGTCGTTC 8600 
GGCTGCGGCG AGCGGTATCA GCTCACtCAA AGG CG GT A AT ACGGTTATCC 8S5G 
ACAGAATCAG GGGATAAQGC AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA 8700 

20 AAAGGCCAGG AACCGTAAAA AGG C CG CGTT GCTGGCGTTT TTCCATAGGC 8750 
TCCGCCCCCC TGACGAGCAT C AC A AAA AT C GAGGCTCAAG TCAGAGGTGG 8 8 00 
CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC CTGG&AGCTC 8850 
CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG 8300 
CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCATAGCTC ACGCTGTAGG 8350 

25 TATCTCAGTT CGGTGTAGGT CGTTGGCTCC AAGCTGGGCT GTGTGCACGA SO 00 
ACCCCCCGTT CAGCCCGACC GCTGCGCCTT AXCCGGTAAC TATCGTCTTG .9050 
AGTCCAACCC GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT 5100 
AAC AG G ATT A GCAGAGCGAG GTATGTAGGC GGTGCTACAG AGTTCTTGAA 9150 
GTGGTGGCCT AACTACGGCT ACACTAGAAG GACAGTATTT GGTATCTGCG 3200 

30 GTCTGCTGAA GCCAGTXACC TTGGGAAAAA GAGTTGGTAG CTCTTGATCC 9250 
GGCAAACAAA CCACCGCTGG T&GCGGTGGT TTTTTTGTTT GCAAGCAGCA 9300 
GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA 9350 
CGGGGTCTGA CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC 54 00 
ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA ATTAAAAATG 94 SO 

35 AAGTTTTAAA TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT 9300 
ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG TCTATTTCGT 3550 
TCATCCATAG TTGC CTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA 9600 
GGGCTTACCA TCTGGCCCCA GTGCTGCAAT GATACCGCGA GACCCACGCT 9650 
GACCGGCTCC AGATTTATCft GCAATAAACC AGCCAGCCGG AAGGGCCGAG 9700 

40 CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT CTATTAATTG 9750 
TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG 9S00 
TTGTT.G C CAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC GTTTGGTATG 9850 
GCTTCATTCA GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC 9900 
CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA 9 9 SO 

45 GAAGTAAGTT GGCCGC AGTG TT'ATCACTCA TGGTTATGGC AGCACTGCAT 10 000 
AATTGTCTTA CTGTCATGCC ATCCGTAAGA TSCTTWCTS TGACTGGTGA 10050 
GTACTCAACC AAGTCATTCT GAG AAT AGTG TATGCGGCGA CCGAGTTGCT 10100 
CTTGC CCGGC GTCAATACGG GATAATACCG CGCCACATAG CAGAACTTTA 10150 
AAAGTGCTCA T CATTGG AAA ACGTTCTT GG GGGCO AAAAC TCTCAAGGAT 10200 

50 CTTACCGCTG TTGAGATCGA GTTCGATGTA ACCCACTCGT GCACCCAACT 10250 
GAT CTTC AGO ATCTTTT ACT TTCACCAGCG TTTCTGGGTG AGCAAAAACA 10300 
GG A AGG C AA A AT GCOGCA&A AAAGGGAATA AGGGGGACAC GGAAATGTTG 10350 
AATACTCATA CTCTTGCTTT TTCAATATTA TTGAAGCATT TATCAGGGXT 10400 
ATTGTCTCAT GAGCGGA7AC ATATTTGAAT GTATTTAGAA AAATAAACAA 104 SO 

55 ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCAC 10487 



SEQ ID NO: 33 (con&Ibumln polyA) 
tctgccattg cfcgct- C.eccc fcgcccttcct 
60 gcc&cagc&s gaaataaaat ctcaaca.tbt 
gttaagcacs fctcctxcccc agcacccctt 

ATL1JB03 '33492.5 



cgtcactctg satgtggcbt cttcgctact 
aaatgggttt cctgaggttt ttcaagagtc 
gctgcaggcc agtgccaggc accaacttgg 
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etactgctgc ecatgagaga aacccagttc aatactttee asagcaaaat ggatfcacata 
tgccctagat cctgafctaac aggcgtttgc attatctagt gctttcgctt cacccagatc 
atcccatCQC efcccc 

3EQ ID ^0:34 (exemplary antibody light chain sequence) 

1 gagctcgtga tgacccagac tceatcetcc ctgtetgcct ctcfcgggaga cagagteacc 
sa atcagtfcgea gggeaaatea ggacatfcagc aattatttaa actggtatca gcagaaacca 
121 gatggaactg ttaaactcct gatctactac acatcaagat cac&atcagg ggtcccatca 
10 181 aggttcagtg geagtgggr.c tggaacagat tactctctca ccatcagcaa cer.ggagcaa 
241 gaagattttg ccacttactt Ctgecaacag ggtaafc&cgc tr.ccgtggac gtteggtgga 

3 01 ggcaccaacc fcggaaatcaa acgggctgat gctgcaccaa ctgtatccat ctccccacca 
3 SI fcccagtgagc agttaacafcc tggaggtgcc tcagtcgtgt gettcefcgaa caacttccac 

4 21 cccaaagaca tcaatgtcaa gtggaagafcfc gatggcagtg aacgacaaaa tggcgtcctg 
15 4 82 aacagttgga ctgatcagga cagcaaagac agcaccfcaca gcatgagcag caccctcacg 

541 ttgaccaagg acgagtatga acgacataac agctat&cet gtgaggccac tcacaagaca 
6 01 tcaaettcac ccafctgtcaa gagctccaac aggaatgagt gttaa 



20 SEQ ID NO j 35 (exemplary antibody heavy chain sequence) 

1 cc.cgagtcag gaectggccc ggtggcgccc ccacagaacc tgtccatcac tr.gcacc.gtc 
61 tctgggtttx eactaaccag ctatggtgta cactgggttc gccagcctec aggaaagggt 
121 ctggaatggc tgggagcaat axggactggt agaagcacaa cttataactc ggctctcatg 
1B1 frccagactga gcafcc&gcaa agacaactcc a&gagccaag ttfctcttaaa aacgaacagt 

25 241 ctgcaaactff atgacacagc catttactac tgtggcagag ggggtctgat tacgtccttt 
3 01 gefcatggact actggggtca aggaacctca gtcaccgtct cctcagccaa aacgaeaccc 
361 ccafcctgtec acccaetggc ccetcggatct gctgcccaaa ctaaetccac ggtgaccctg 
421 ggafcgcctgg tcaagggcta fcfc'cceetgag ceagfcgacag tgacctggaa ctctggaccc 
481 ecgcccagcg gtgcgcacae cttcccagct gtcctgcagt ctg&cctcta eactctgagc 

30 541 agcLcagtga ctgtccccfcc cagcacctgg cccagcgaga ccgtcacctg caacgttgcc 
SOI cacccggcca gcagcaccaa ggtggacaag aaaafctgcgc ccagggattg taetagt 

BEQ ID NO; 36 CpTnMCS) 

35 1 ctgacgcgcc ctgtagcggc gcatfcaageg cggcgggtgt ggtggfccacg cgcagcgtga 
61 ccgctacact tgccagcgcc ctagcgcccg ctcctfctcgs ttcotfcccct ccctttcteg 

121 cc;acgfctcgc cggcatcaga ttggctattg gccautgc&fc acgtcgtatc ca'cateauaa 

181 tatgtacatt tataccggcc catgtccaae atcaccgcea cgtcg&c&tt: gatvaccgae 

241 tagttattaa tagtaatcaa ttaeggggtc astagtvcat. agcceatata ~.ggagfctccg 

40 3 01 cgfccacat'aa cttacggtaa atggcccgce 'cggctgaccg cccaacgact cccgcccatx 

3 61 gacgccaata atgacgtatg ttcccatagt aaegceaata gggaetttcc at tgacgtca 

421 atgggcggag fcattcaeggc a&actgccca cttggcagta eatcaagtgt atcatatgcc 

$81 aagfcacgecc cct&ctgacg ccaafcgaegg taaafcggccc gcccggcatt atgeecagta 

531 csu.gacccr.a cgggacr.tr.c ctacctggca gcacatctac gfcatcagtca tcgctattac: 

45 S01 Cwtggtgatg cggtcttggc agtacaccaa tgg9cgtgga tagcggtttg actcacgggg 

661 atttcca&gt ctccacccca ttgacgtcaa tgggagtt^g tttcggcaec aaaatcaacg 

721 ggactttcca aaatgtcgta asaactccgc cccattgacg caaacgg«c*g gtaggcg^gt 

7 SI acggtgggag gtctatataa gcagagctcg -cfcagtgaac cgtcagatcg cctggagacg 
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841 ccatccacgc fcgttttgacc tccatagaag acacegggac egatccagcc tcegcggccg 
901 ggaacggtgc attggaacgc ggattccccg tgecaagagt gaegtaagta ccgcctatag 
9£1 acfcctatagg ca ca c c c c 1 1 tggc t c t t a t gcatgctata ctgtttttgg cttggggcct 
1021 atacaccecc gcctccktafc gctafcaggfcg atggtatagc tcagccfcata ggtgfcgggtt 
1081 atfcgaccafcfc s t tg«cc*ct iccc Cat tg^ tgacga cact ct.ccat.tact aatccata&e 
1141 atggctcttt gccacaacta tGtcta?:tgg ctatatgcca atactctgtc cttcagagac 
1201 tgacaeggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 
i;acaacascg ccgtcccccg tgcccgcagt: ttt.tatcaaa catagcgtgg gatctcca^g 
cgaatctcgg gtacgt.gctc cggacacggg ctcctctccg gtagcggcgg agct Lecaea 
tccgagcccfc ggtcccatgc ctceagcggc fcc&fcggfccgc- ccggcagctc cttgctccfca 
1441 ctcagtggagg ccagacttag gcar^agcaca atgcccacca ccaccagtgt gccgcacaag 
1501 c]cogfcggcgg tagggtatgt gtctgaaaafc gagcgtggag attgggctcg cacggctgac 
1S61 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtatfcc 
1621 tgafcaagagt cagaggtaac tcccgtcgcg gt.gcfc.gfc fcaa cggtggaggg cagfcgtagtc 
1681 tgagcagtac tcgttgctgc cgcgcgcgcc aceagacata atagctgaca gactaacaga 
1741 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgcga actcgatatt 
1801 tcacacgact ctcf.ttacca attctgcccc gaavtacact taaaacgacfc caacagcfcfca 
1SS1 acgfctggctt gccacgeact aectgactgt a&aactctea ctcttaccga acttggccgt 
1921 aacetgccaa ceaaagcgag aaeaaaacat aacafceaaac gaatcgaccg attgttaggt 
1981 aatcgtcacc tccaeaaaga gegactcget gtataccgtt ggcatgctag ctttatctgt 
tegggcaata egatgcccat tgcaettgtt gactggtctg atatSpcgtga gcaaaaacga 
ctfcatggtat tgcgagcttc agtcgcacta cacggtcgtt ctgttactct ttatgagaaa 
gegttcccgc tttcagagca atgt^caaag aaagctcatg accaattcct agccgacctt 
gcgageattc taccgagcaa eaccacaceg ctcattgtca gtgacgctgg cfcttaaagtg 
ccatggtata a a t c c g t. tg a g a age fc ggg t t. gg fc a c £ g g t Caagtcgagfc aagaggaaas 
gfcacaatatg cagacctagg ageggaaaac tggaaaccta fceagcaaett acatgatatg 
tcatetagtc actcaaagae tttaggctac : aagaggctga etaaaagcaa tccaacctca 
tgecaaatte tattgtafcaa afcetcgctefc aaaggccgaa aaaatcagcg etegacaegg 
actcattgtc accaeccgtc acctaaaatc tactcagegt eggcaaagga gccatgggtt 
ctagcaacta acttacctgt tgaaattega acacccaaac aactfcgttaa tatctaefceg 
aagcgaatgc agattgaaga aaccttcega gacttgaasa gtcctgccca cggact.aggc 
2701 cfcacgccata gecgaacgag :cagctcagag cgfcttfcgata teafcgetget aafcegccetg 
2761 y.tgefctcaae; fcaacatgfctg gettgeggge gfctcatgetc agaaacaagg fcfcggg&caag 
uacttccagg ctaacacagt cagaaatcga aacgcacfcct caacagt teg cttaggcatg 
gaagttttgc ggcattctgg ctacacaata acaagggaag act tact cgt ggctgcaacc 
ctaccagctc aaaatttatt cacacatggt tacgcctcgg ggariatcacg aggggatege 
tetagagega teegggatet egggaaaage gccggr.gacc aaaggv.gcct tvtatcatca 
cfct t aaaaa t aaaaaacaa t tactcagtgc ctgttataag : cagcaat taa 1 1; a tgat t ga 
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cgcctacatc acaacaaaaa ctgatttaac aaatggttgg f.ct gectvag aaagtatatt 
tgaacattat cttgatcata ttac.f.gaf.aa taataaaaac cCtatcccta tccaagaagt 
gatgectate attggttgga aCgaacttga aaaaaattag ccttgaatac attactggta 
agg'caaaege cattgtcagc aaatfcgatcc aagagaacca acttaaagct ttcctgaegg 
aatgttaatt ctcgttgacc ctgagcactg atgaatcccc taac.gat.ttt ggtaaaaatc 
attaagttaa ggtggataca eat.cCtgtca tatgatcccg gtaatgtgag ttagetcact 
catt.aggcac cccaggcttt. acactttatg cttccggctc gtatgttgtg tggaattgtg 
^g^ggsc^ac aatttcacac aggaaacagc catgaccatg attacgecaa gegegcaatt 
aaccctcact aaagggaaca aaagctggag ctccaccgcg gtggcggccg ctccagaact 
agtggatccc ccgggccgca ggaattcgat atcaagctta tcgataccgo tgaectcgag 
?jgggggcccg gtacccaatt cgccctatag tgagtegtat tacgcgcgct cactggccgt 
cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc 
t-catcccccr. ttcgecagot ggogtaa tag egaagaggee cgcaccgatc gcccttccca 
i?,cagt:tgcgc agectgaatg gcgaatggaa afctgtaagcg ttaatatttt gttaaaattc 
gegtcaaatt tttgtraaat cagctcattt ttfcaaccaat aggecgaaat eggcaaaate 
ccttataaat caaaagaata gaccgagata gggttgagtg ctgttccagt teggaacaag 
4 051 agtccactat taaagaacgt ggactccaac gtcaaagggc gaaaaaccgt ctatcagggc 
4141 fiatggcccac tactceggga r.catar.gaca agatgtgtat ccaccttaac ttaatgattt 
ttaccaaaat cactagggga fetcatcagfcg ctcagggtca acgagaatta acattccgtc 
aggaaagctt atgatgatga tgtgcttaaa aacttactca atggctggtt atgeacateg 
castacatgc gaaaaaccta aaagagct?:g ccgataaaaa aggecaattt afctgctattt 
aecgeggctt tttattgagc ttgaaagata aataaaatag ataggtttta tttgaagcta 
aatcttcttt ategtaaaaa atgccctctt gggttatcaa gagggtcatt atatttcgeg 

gaaataacta 
tcatcgatag 
cccaaa ttgg 

"ctgcattggt aaggctcacc aataatcccfc gtaaagcacc 
ttgctgatga ctctetgttt ggatagacat cactccct.gt. aatgcaggta aagcgafccc 
4901 accaccagcc a at a a a a 1 1 a a a a c aggg a a aaetaaceaa ccttcagata taaacgctaa 
4SS1 aaaggcaaat gcactactat ctgcaataaa tccgagcagt actgccgttt cttcgcccat 
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gaataacatc 
tcgaggggtt 
aataatccaa 
caatgggeca 



at£Cggtgac 
aacatgaagg 
atccagccat 
ataacacegg 



agcacttgtc tcctgtttac vcccctgagc 
caggataata atacagt«<aa acgctaaacc 
tagtgaatga ttataaataa oagcaaacag 
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4 921 cfc&gfcggcta ttcttectgc cacaaaggct tggaataetg agtgtaa&ag accaagaccc 
4981 gtaatgaaaa geeaaccatc atgetattca tcateacgat tfcetgtaata gcsaccacacc 
5041 <jtgctggafct ggctatcaat gcgctgaaafc aataataaac aaatggcate gttaaataag 
5103. -gatgtatae egateagett ttgttccctt tagtgagggt fcaatfcgegcg cfctggegtaa 
5 5161 ^.eatggteat agctgttLcc tgtgtgaaac fcgttacecge tcacaattcc acacsacata 
5222 cgagccggaa geataaagtg taaagcetgg ggtgccfcaat gagtgagcta actcacstta 
5231 atcgcgttgc gctcaetgcc cgctfctceag tcgggaaacc fcgtcgfcgcea gctgcatcaa 
5341 tigaatcggcc aacgcgcggg gagaggeggt ttgcgtactg ggcgctcttc cgettcctcg 
5401 cfceactgact cgctgcgctc ggfccgfctegg ctgcggcgag cggtatcagc tcactcaaag 

10 5461 gcggta&tac ggttatccae agaatcaggg gataacgeag gaaagaacat gtgagcaaaa 
5521 ggccagcaaa aggccaggaa cegtaaaaag gccgcgttgc tggcgttttt ccataggctc 
5581 cgeeeecetg aegageatca caaaaatega cgctcaagtc agaggtggcg aaacccgaca 
5£41 ggactat&aa gataecaggc gttfcocccct ggaagctccc tcgtgcgcte tcctgefcceg 
5701 accctgccgc ttaccgg&ta cctgtccgcc tttctccctfc cgggaagcgt ggcgefcttct 

IS 5761 catagctcac gctgtaggta cctcagttcg gtgtaggtcg tt.cgctccaa gctgggctgt 
5821 gtgcacgaac cccccgttca gccegaccgc fcgcgecttafc ccggtaacta tcgtcttgag 
5881 tccaacccgg taagacacga cttatcgcca etggcageag ccacfcggtaa caggafctagc 
5941 agagcgaggc acgtaggcgg tgctac&gag fctcttgaagt ggtggccfcaa ctacggctac 
6001 actagaagga cagtatttgg tatctgcgct ctgctgaage cagttaccfct cggaaaaaga 

20 6061 gttggtagct cttgatccgg caaacaaacc aecgctggta gcggtggfcfcfc tfctugtfcfcge 
S12I uaccagcaga ttacgcgcag aaaaaaagga tctca&gaag atcctfctgat ctttcetacg 
6181 gggtctgacg ctcagtggaa cgaaaactca cgctaaggga ?:cttggtcat gagattatca 
6241 ^aaaggatet tcacct'agat cettttaaat: fcaaaaatgaa gttttaaatc aafccfcaaagt 
6301 atatatgagt aaacttggtc tgacagttac caatgcttaa fceagfcgaggc acctatccca 

25 6362 gcgatctgtc tatfctcgctc atecatagtt gcctgactcc ccgscgtgfca gataactacg 
6421 f.tacgggagg gcttaccatc tggcccc&gt gcfcgeaatga tacegcgaga cccacgctca 
6481 ccggctccag at & tat cage aataaaecag ccagceggaa gggcegagcg cagaagtggt 
654 1 cccgcaaefct tatccgcccc carccagtct attaattgtt geegggaage tagagtaagt 
S602 agtfccgecag ttaatagttt gcgeaacgtt. gttgccattg ctacaggoat cgtggtgtca 

30 €661 egctcgfcegfc ttggtatggc ttcatfccagc tecggtfcccc aacgatcaag gcgagttaca 
6721 tgatcececa tgctgtgcaa aaaageggtt ageteefcteg gtcctccgafc cgttgtcaga 
€781 agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcacaa tccccteace 
6641 gtcatgecat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcaetctga 
8901 gaatagtgta tgcggcgacc gagttgetet tgcccggcgt caauaeggga taataccgcg 

35 €961 ccacatagca gaactctaaa agtgctcatc attggaaaac gtccttcggg gegaaaaetc 
7021 tcaaggatct taccgctgtt gagafcecagt tcgatgfcaac ccactcgtgc acccaactga 
7081 tct:tcagcat cttttacttt caccagcgtt teegggtgag caaaaacagg aaggcaaaaJ; 
7X41 geegcaaaaa agggaataag ggegacaegg aastgttgaa t.act:catact cttccttttt 
7202 caatattatt gaagcatcta tcagggttat cgscccatga geggatacat atttgaatgt 

40 7261 attr.agaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gecac 



ID NO: 3 7 (chicken ovalbumin, ehancer) 
cegggbtgea gaaaaatgee aggtggacca cgaactcaca cccaaaggag 
ettga'petga tacctgattt tcttcaaact ggggaaacaa cacaatccca caaaacagct 
45 cagagagaaa ocatcactga tggctacagc accaaggtat gcaatggcaa tccattcgac 
attcatccgt gacctgsgca aaatgattta tctctccatg aatggttgct tctttccctc 
atgaaaaggc aatttccaca ctcacaatat gcaacaaaga caaacagaga acaattaatg 
tgctcrt^cc taatgtcaaa attgtagtgg caaagaggag aacaaaacct caagttcLga 
gtaggtttta gtgattggat aagaggcttt gaectgtgag ctcacctgga cttcatatcc 
50 ttttggataa aaagtgcttt tataactttc aggtctccga gtcttcattc atgagactgt 
tggtttaggg acagacccac aatgaaatgc ctggcatagg aaagggcagc agagecttag 
ctgacctt&t cfctgggacaa gcattgtcaa acaatgtgtg acaaaactat ttgtactgcr. 
ttgeacaget gtgctgggca gggcaatcca ttgccaccta tcccaggtaa ccttccaact 
gcaagaagat tgt: t:gct: txac fcetefcefcaga 

55 

BEQ ID NO s 38 {$' untranslated region) 

GTQGATCAA.CATACAQCTAQAAAGCTGTATTGCCTTTAGCACTCAAGCTCAAAA.GACAACTCAGAGTTC 
ACC 

60 BEQ ID ^0:39 (putative cap site) 

ACATACAGCTAG AAAGCTGTAT TGCCTTTAGC ACTCAAGCTC MAAGACAAC 1CAGAGTTCA 

BEQ 10 NO- 40 {fragment of ovalbumin px-oraoter - chicken) 
GAGGTCAGAAT GGTTTCTTTA CTGTTTGTCA ATTCTATTAT TTCAATACAG 
65 AACAATAGCT TCTATAACTG AAATATATTT GCTATTGTAT ATT ATG ATTG 
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GAGG 
AGT'rCATATC 
GAG CTATGTT 
ACACCCATAA 
CGTCTGCTCT 
TTATTTCTCC 
TTATGTCCTG 
TCAAATGAAA 
ACTAACTAAT 
TTTCTGTTTT 
GA&CATGAGC 

TATTTGCTCT 
CCCAATCCCA 
GGAACCTGTG 



CATGAACACT 
CCATTAAGTT 
ATAAACACAT 
TTOCTGTATv 
AAAGATAGAT 
TCACTCTAGT 
TATTTTGTCA 
CCTAGCATGG 
CAGACTTCTG 
AATTGCTAAT 
CTTAAAGATC 
AAa'ATTTGCC 
ACAGGCAGAA 
CCATTCAATC 
TTAAATGATT 
GGTGGGTCAC 



CCTCCAGCTG AATTTCACAA 



ATTCATGGAA GATCTTTGAG 

GTATTGTTTT 
AAGTTTGTTA 
CAGCTATAGG 



CAGAAAAA 
■1TAAATATTC- 
CTCAGTTGGC 
AGAAAATAAT 
CTGAGATGGA 
GTCTGTTACT 
TATGTTTTCC 
CCATTATCTG 



AGTCTTCTCT 



TC CTT C AC AT 
AGGTCACGTC 
CGTTG TAG AT 
AC A AC CAT AG 
ATCTCTAAGG 
GTTGT AACTG 
CCCATCCAAC 



AACACATTG? TACCCAGAAT 
CAAAATGGAC CTATTGAAAC 
TCTATGGCGT CAAAGGTCAA 



■AATTCAGGCT 



ATATATTCCC 



TT C CTCTGTC 
GAAC&CTGCA 
SCAT TGTATG 
TAAAGCATTC 
AAAGAAAGTG 
GCATGCTTCT 
XTGTTCTCAC 
ACAAGAAGGA 
TAATAAGCAC 
TTCCCACATT 
AAGCTCAATG 
AGTGCTGATG 
TAAAAACTAA 
TAAAATCTAA 
ACTTCTGAAG 
CAGGGCTCAG 
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SBQ ID NO : 4 1 pTnMCS < CWV - CHOVgi - e n t> Pro 1 ns ul in - s yn PA } 

1 ctga.cgcgcc cegfcagcgge gcafcc.aageg cggcgggtgfc ggtggttacg cgcagcgfcga 
61 ccgetacact fcgccagcgce cfcagcgrcccg . ctecv.tccgc fctfcettccet tcctttcfccg 
121 ecacgtt cgc cggcateaga 'o gge v.a & t.g gcca v. * g ca fc acgttgtatc eatat&ataa 
181 fcatgtacatt tatattggcfc catgte'caas atfcaccgcca tgtfcgaeatt gafctastgac 
241 tagttattaa tagfcaatcaa ttacggg^tc afctagttcat agcccafcata tggagtfcccg 
3d cgtfcacstas cfcti&cggfc&a atggcccgccc tggqtgaccg cccaacgacc cccgcccatt 
361 gacgtcaata atgacgtatg tv.cccacagt: aacgecaava gggactxntcc attgaegtca 
421 afcgggtggag tatttacggt aaacfcgccca cfctggcagta catcaagtgt afccacatgcc 
4 81 aagtacgccc ectiattgacg tqaatgacgg fcaaatggccc gcctggcstt atgcccagta 
5.41 catgacctta tgggaetfctc efca c fctggca g taca *c c t ac gtattagtea t ogct attac 
601 catggtgatg cg-gfctttggc agtacatcaa tgggcgtgga tagcggtfcg aoceacgggg 

atttccaagt ctccacccca tfegacgfccaa taggagttcg £tttggcacc aaaatcaacg 
ggactttcca aaafcgtcgta acaactccgc cceattgacg caaatgggcg gtaggegtgt 
a";ggtgggag gtcfcatataa ge&g&gcfccg tttagtgaac cgtc&gatcg cctggagacg 
cs'catccacgc tgttttgacc teqafcagaag acaccgggac cgatccsgcc tccgcggccg 
goaacggtgc attggaacgc ggatcccccrg r.gccaagagt gacgtaagt-a ccgcct:at«g 
a^jtctatagg cacacccct t tggc tct t -it gcatgcfcata ctgtttttgg cf.tggggcct 
1021 atacaccccc gcttccttaC gctatacgtg af.ggtatagc tt^gcctata ggtgtgggtt 
1081 attgsccatt ai't.gaccacfc cccct'^t r;.gg tgacganacr; ttccattact astvccata^c 
atggctcttt gccacaacta tctctattg^ ctatafcgcca atactct^tc cttca.gagac 
t';gacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 
tacaacaacg ccgt-cccccg f.gcccgcsgt fetttattaaa catagcgtgg gatctccacg 
cgaatctcgg gtacgtgctc cggacatggg ctcttcccc« gtagcggcgg agcttccaca 
tccgagccct ggtcccatgc ctccagcggc tcratggtcgc tcggcagctc cttgctccta 
«cagtggagg ccagacti'ag gcacagcaca atgcccacca ccaccagcgt gccgcacaag 
csccqfcqotcorg taqgatatgt qtctqaaaat gaqcqtqqag attgcjgctcg cacggctgac 
gcagatggaa gactf.aaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 
cgavaagagt. csgaggtaac tcccgttgcg: gtgctgttaa cggtggaggg oagtgtagtc 
tgagcagcac tcg ttgctgc cgegcgcgcc acc<agacata a t -age cgaca gsctaacaga 
ctgttccttt ocatgggvee 1 1. ^ct.gcagt oaccgccgga ccatgtgcga actcgatatt 
ttacacgact ccccttacca attctgcccc gaattacaet taaaacgact caacagctta 
aegftgg c 1 1 gccacgca t t act tga ctgt aaa ac tct ca ct c t caccga ac c tggccgt 
aaocCgccaa ecaaagegag aacaaaacat aacatcaaac gaatcgaccg atfcgttaggt. 
aatcgtcacc tccacaaaga gcgactcgct gfcataccgtt ggcatgetag cttfcatctgt 
t.zqiszc&z.tB cgatrgcccat tgtacttgtt gactggtctg atattcgtga gcaaaaacga 
c;ttatggtat tgccagcttc ag'ccgcacfca caeggtegtt ctgttactct tfcafcgagaaa 
c;cgctcccgc f. r. c c aga g c a a t g c t c aa a g a aa g c t c a v 9 accaatttct agccgaccct 
cscgagcattc taccgagtaa cacc;aca?!cg ctcavtgtcrja gv.gar,gccgg t"Jtt;t;aaagtg 
<:catggtafca aat'.cicgtvga gaagctgggt tggtactggf, taagtcgagt aagaggaaaa 
2341 gtacaatatg cagac^tagg ageggaaaac tggaaaccta tcagcaactt acatgafcatg 
2401 f cau.c?jagr;.o aeceaaagac tttaggctat aagaggctga ctaaaagcaa ^ccaacctca 
24^1 t. gecaaav tc tattgtataa atctcrgetct aaaggccrgaa aaaatcagcg ctcgacacgg 

2 521 i.ctcattgtc accacccgtc ^cctaaaatc tactcagegt eggcaaagga gccatgggtt 
25S1 ctagcaacta ac-;v.t-.acctg'c tgaaat'ocga acacccaaac aacfeg^taa ca tct at teg 
2^41 aagcgastgc agactgaaga aaccttccga gacttgaaaa gfccctgcc:ta eggactagge 
2701 r.tacgecata gecgaacgag cagctcagag cgttttgata tcatgctcgct aatcgccctg 
27^1 ^tgcttcaac taacatgttg gctCgcgggc gttcacgctc agaaacaagg ttgggacaag 
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2921 cactcccagg cfcaaeacagt cagaaafcega aacgsactct caacagctcg cttaggcatg 
2881 gaagttxtgc ggc&fctctgg ctacacaata acaagggaag acttactcgt ggctgcaacc 
2941 ctaetagcfcc aaaatttatfc eiaeaeatggt tacgctttgg ggaaattatg aggggatcgc 
3001 fccfcagagcga teegggatet cgggaaaagc gfctggtgacc aaaggtgcct fcfctafcc&tca 
5 3061 cttkaaaaafc aaaaaacaat tactcagtgc ctgtfcataag cagcaafct&a ttatgattga 
3121 tgcctacate aeaacaaaaa efcgatfctaac aaatggtcgg tctgcefctag aaagtatatc 
3181 tgaacattafc efcfcgafccata ctattgataa taafcaiaaac cttatcccta tccaagaagt 
3241 gatgcct&tc attggtfcgga afcgaacfctga aaaaa&ttag ccfctg&atae stt&ctggta 
3301 a ^ taaac ^ c cattgtcagc aaattgstcc aag&gaacca actfcaaagct ttccfcgacgg 

10 3361 aatgttaatt ctcgttgaec ctgagcacfcg atgaatcccc taatgattvt ggtaaaaacc 
3421 attaagfcfcaa ggfcggafcaca catcstgfcca tatgatcccg gtaatgfcgag ttagcteacjt 
3481 cactaggcac cccaggcttt acactxtatg cttccggctc gcatgttgtg tggaatcgcg 
3 541 a0cggataac aatttcacac aggaaacage tatgaecatg attacgccaa gcgcgcaane 
3£0I aacectcact aaagggaaca aaagcfcggag ctccaccgcg gtggcggceg ctefcagaacfc 

15 3661 agtggatcec ecgggeatea gatfcggetat tggccattgc acacgtfcgfca fcecafcatcat 
3721 aatatgtaca ttfcatafcfcgg ctcafcgteea acatfcaccgc catgtc-gaea ttgactattg 
3781 actagttatt aafcagtaafce aatfeaegggg tcafcfcagtfcc atagcecata tatggagttc 
3841 cgegttacat aacttaeggt aaatggeccg cctggcfcgae cgecca&cga cccccgccca 
3 301 fcfcgacgtcaa taatgacgta tgttcccata gtaaegec&a tagggacttt ccattgacg'c 
3961 caatgggtgg agtatttacg gtaaactgce eaefcfcggeag tacatcaagt gtatcatatg 
4021 ccaagtacgc ccectatfcga cgteaafcgac ggtaaatggc cegcctggca fctatgeccag 
4081 tsacatgaccfc tatgggactt tcetacttgg eagtacacct acgfc&t-Eagc catcgctact 
4141 aecatggtga tgcggct.r.tg gcagtaeatc aatgggcgtg gatagcggtt tgactcacgg 
4201 ggafcfcfceeaa geetccaccc cattgacgtc aatgggagct tgktttggea ccaaaatcaa 

25 4261 cgggaetttc caaaatgtcg taacaactcc gccecatxga cgcaaafcggg ogqtaggcgt 
4321 gfcacgg-ggg aggtctatat aagcagagct cgtttagtga accgtcagat cgcctggaga 
4391 cgccatccac gcfcgtttfega ectccataga agacaccggg accgatccag cetcegcggc 
4441 cgggaacggc gcafcfcggaac gegg&ctccc cgtgccaaga gtgaegtaag fcaccgcctat 
4501 agactctata ggcacaeccc tctggccctt atgcatgcta tactgctctt ggcfct:ggggc 

30 4561 ctatacacec ecgcttcefce afcgctatagg tgatggfcafca gctfcagccta fcaggtgtggg 
4621 ttatfcgaeca ttafctgacca ctcccctatt ggtgscgata efcfctcc&tta ctaatccata 
4631 aeatggcfcct ttgccacaac tatctetatt ggctatatgc caatacfcctg tccttcagag 
4741 actgacacgg actctgtatt tttaeaggat ggggtcccat tctattattta caaatteaea 
4801 tatacaacaa cgccgtcccc cgtgcecgea gtttttatta aacatagcgt gggacctcca 

35 4361 cgcgaatctc gggtacgtgt cccggacatg ggctctt;ctc cggtagcggc ggagcttcca 
4S21 catccgagcc ctcggtcccat gcctceageg gctcatggtc gctcggcagc t.cat"ogcf.c4? 
438 1 taacagtgga ggccagactt aggcacagca caatgcccac caccaccagt gtgccgcaca 
5041 aggccgtggc ggtagggtat gtgtctgaaa atgagcgtgg agattgggct cgcacggctg 
5101 acgcagatgg aagacttaag gcagcggcag aagaagatgc aggcagct.ga gttgttgtat 

40 5161 fcctgataaga gteagaggta actccc^ttg cggtgctgtt aacggtggag ggcagfcgtag 
3221 tctgagcagt actcgttgct gccgcgcgcg ccaccagaca taatagctga cagactaaca 
S2S1 g&etgttcct. s:tccatgggt ctcttctgca gtcaccgtcg ggatccatgg gctccatcgg 
5341 cgcagcaagc atggaatttt gttttgatgc atteaaggag eteaaagccc accatgccaa 
5401 tgagaacatc ttctaetgec ccattgccafc catgfccagct ctagccatgg t:atacctggg 

4$ 5461 tgcaaaagac agcacc&gga cacagataaa taaggttgtt egctttgaca aacttccagg 
5521 attcggagac agtactgaag ctcagtgtgg cacatctgta aacgttcact cctcacctag 
S581 agacafccctG aaccaaatca ccaaaccaaa tgatgfcttat tcgfctcagcc ttgccagtag 
5641 actttatgct gaagagagat acccaatcct gccagaatac ttgcagtgtg tgaaggaact 
5701 gtatagagga ggcttggaac ctatcaac£t tcaaacagct gcagatcaag ccagagagct 

50 5761 cateaattcc tgggtagaaa gtcagacaaa tggaattatc agaaatgtcc ttcagccaag 
5821 ctccgtggat; tctcaaactg caatggttct ggcfcaatgcc attgtcttca aaggactgtg 
588! ggagaaaaca tttaaggatg aagacacaca agcaatgcct ttcagagega ccgagcaaga 
5941 aagcaaacct gtgcagatga r.gtaccagat tggtfctactc ag^gtggcat caatggcttc 
S001 fcgagaaaatg aagatcctgg agcttccatt tgecagtggg acaatgagca tgttggtgct 

bt> S061 gttgcctgat: gaagfcctcag gccttgagca gcctgagagfc ataatcaact ttgaaaaact 
€121 gactgaatgg accagttcta atgttatgga agagaggaag atcaaagtgt acttacctcg 
&1Q1 caitgaagatg gaggaaaaat acaacctcac atcfcgfcefcfca atggctafcgg gcattactga 
S241 cgtgtttagc tcttcagcca atctgcctgg catcccctca gcagagagcc tgaagatatc 
6301 ccaagetgtc cacgcagcac atgcagaaat caatgaagca ggcagagagg r.ggtagggtc 

60 agcagaggct ggagtggatg ccgcaagcgt ctctgaagaa tttagggctg accatecatt; 

6421 cctcttctgt. atcaagcaca tcgcaaccaa cgccgttctc ttctttggca gatgtgtttc 
6481 ccgcggccag cagatgacgc accagcagat gacgcaccag cagatgacgc accagcagat 
6541 gacgcaccag cagatgacgc aocageagat gacgcaacaa catgtatcct gaaaggctcr. 
5501 tgtggccgga ccggcctgct ggatgacgat gacaaatttg tgaaccaaca cctgcgcggc 

65 t>661 tca'cacc egg tggaagece?: ctacctagtg tgeggggaac gaggcr.rctt ctacacaccc 
€751 aagacccgcs gggaggcaga ggacctgcag gtggggcagg tggagctggg cgggggccct 
S781 ggtgcaggca gcctgcagcc cttggacetg gaggggtccc tgcagaagcg tggcattgtg 
6841 gaacaatget gcaccagcat ctgctccctc taccagcfcgg agaact:actg caactagggc 
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tcafccacgat 
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tagfcgagggfe : 
tgttar.cegc 



taaagcctgg ggtgcctaat 



gcct&aaggg cgaatfcatcg cggccgctct agaccaggcg cctggatcca gatcacttct 
ggcr.aataaa agatcagagc ; tctagagatc tgtgt.gt- tgg cttcttgtgg atetgctgtg 
ccttctagtt gccagccatc tgttgtttgc 'cccfcececcg tgcefctcctfc gacccfcggaa 
ggtgccactc ccactgtqct ttccfcsat<sa aatgaggaaa ttgcafcegca ttgtctgagt 
aggtgtcats: ctattctggg gggtggggtg gggcagcaca gcaaggggga ggattgggaa 
gacaatagca ggcatgctgg ggatgegsfcg g#cccfcatgg gtaectcfcct ctccetctct 
ctctctctct ctctetetct ctctcggtae ctctctcgag ggggggcecg gtaeccaafct 
cgcccf.cuv.csg tgagv.cgtafc tacgcgcgct cactggccgt cgttttacaa cgtcgtgact 
gggaaaaeee tggcgtfcacc caacttastc gcctfcgcagc acatccccct tfccgceagefc 
ggcgtaa tag cgaagaggcc qgcsccgatc gccettecca acagttgcgc agcctgaatg 
gcgaatggaa attgtaageg ctaatatttt gttaaaattc gcgtcaaatfc tttgtfcaaat 
eagctcattfc tttaaccaat aggcegaaat eggcaaaatc ccttafcaaac caaaagaata 
gaccgagats gggttgagtg ctgctccagt ttggaacaag agtccactat taa&gaacgt 
ggactecaac gteaaaggge gaaaaactigt tftatcagggo gatggcccac tactccggga 
teatatgaea agatgtgtat ccaccttaac ttaatgafctt ttaccaaaafc cattagggga 
fcvcatcagtg etcagggfcca aegagaatfca acaixtccgtc aggaaagctt atgafcgatga 
tgtgcctaaa aactt&cxea atggecggtt atgcatatcg caacaeafcgc gaaaaaecta 
aaag&gctfcg ccgataaaaa aggccaafctt afctgctatfcfc aecgcggcfct tcfcatfegage 
ttgaaagata aataaaatag atiag^tfetta fcttgaagcta. aatcttcfctt afccgtaaaaa 
atgccctctc gggttatcaa gagggtcatt at&tfctcgcg geaataacatre atfctggtgac 
gaastaacta ageactcgtc t cc tg 1 1 fcac tccccfcgagc tcgaggggtfc aacatgaagg 
tcatcgatag caggataata atacagtaaa acgctaaacc aataatccaa afcecagceat 
cccaaactgg tagtzgaatga ctataaataa cagcaaacag taatgggcca ataacacegg 
ttgcattggt aaggctcacc aafcaafcecefc gta&agcaea tfcgctgatga ctctttgttt 
ggatagacat cactcccfcgt aatgcaggta aagcgatccc aceaccagcc aataaaatts 
aaacagggaa saccaaccaa ccfc&cagata : caaaegctaa aaaggcaaac gcactactat, 
ctgeaataaa tecgagcagt actgccgtfct tttcgcccat fct&gtggeta ttctfcecfcgc 
cacaaagget tggaatactg agtgtaaaag accaagaccc gfc&atgaaaa gccaaccatc 

ttctgtaata gcaccacacc gfcgctggatt ggctaccaat 
aaacggcafcc gtfcaaataag tgatgtafcae egatcagefct 
t aa r„ t g cgcg c c t ggcg t a a tcatggtcat agetgtttcc 
tea caa ttcc aca caaca ta cgagceggaa gcatsaagtg 
gagtgagcta actcacatca attgcgttgc gcfccactgec 
cgcttcccag togggaaaec tgtcgtgGca getgeattaa tgaateggee aacgegeggg 
gagaggeggt. ttgcgtattg ggcgctctCG cgcttcctcg ctcactgact cgctgcgctc 
ggtcgttcgg ctgeggegag eggt a t cage tcacccaaag geggtaatae ggttatccac 
agaatcaggg gataaegcas gaaagaacat gtgagca^aa ggccagcaaa aggecaggaa 
ccgtaaaaag gccgcgttcjc tggcgttttt ccataggcte cgcccccctg acgagcatca 
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa «ataccaggc 
gtttccccct ggaagctccc t eg t gege t c tec t g 1 1, c eg accctgccgc fctaceggata 
cctgtccgcc tttctccctt egggaagegt ggcgctctct. eatagstcac gctgtaggta 
tefceagtteg gfcgtaggtcg ttcgcfcccaa gctgggctgt gtgeacgaac crccccgttca 
gcccgaccgc tgegecttat ccggcaacfca tegtcttgag tccaacccgg taagacacga 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagegaggt: atgtaggcgg 
tgctacagag ttcttgaagt ggtggcctaa ccacggctac actagaagga cagtatttgg 
tatctgeget c tgc v.gaagc cag *c t acct t eggaaaaaga gttggtagct ct'cgatccgg 
caaacaascc accgccggva gcggtggttt ttttgtttgc aagcagcaga cvacgcgcag 
aaaaaaagga tctcaagaag atccttfcgat cttttatacg gggtcfcgacg ctcag^ggaa 
cgaaa<actca cgttaaggga ttttggtcat gaga t tat ca aaaaggatct tcacctagat 
ccctttaaat ta^aaatgaa gtttttaaatc aaf ccaaagt atatatgagt aaact?:ggtc 
tgacagttac caatgettaa tcagtgaggc asctatctca gcgatctgtc tatttegtte 
atccat.agtt gcctgactcc ccgtcgtgta gataactacg ataegggagg gcfctaccafcc 
tggccccagt gctgcaafcga taccgegaga cccacgccca ccggctccag atttatcagc 
aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tar.ccjgcecc 
catccagtct attaattgtt geegggaage tagagtaagt agttcgccag ttaatagttt 
gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgetcgtegt tfcggtatggc 
ttcattcagc tccggttccc aacgatcaag gcgagctaca tgatccccca fcgttgtgcaa 
aaaagcggtt agct cc tt eg gtcct begat cgttgtcaga agtaagttgg ccgcagtgtt 
atcactcatg gttatggcag cactgeataa ttctcttact gteatgecat ccgtaagatg 
cttttctgtg actggtgagt acteaaccaa gtcattctga gaatagtgta tgcggcgacc 
gagttgetet tgcccggcgt caat.acggga taataccgcg ccacatagca gaactttaaa 
agtgcccatc attggaaaac gtv.cttcggg gegaaaaetc tcaaggatct taccgr/vgut. 
10<$:-a gagatccagt tcgatgtaac ccactcrgtgc acccaactga tcttcagcat c^tf.tacvtt 
lOfe'Sl caccagcgtt tctgggtgag caaaaacagg aaggcaaaat geegcaaaaa agggaacaag 
107^1 ggegacaegg aaatgttgaa tactcatact cttccttfctt caatattatt gaagcattta 
1QS0X tcaggg-tat tgtctcafcga gcggatacafc atttgaatgt atttagaaaa ataaacaaa-c 
10861 ag^ggt:tccg cgcacstttc cccgaaaagt gecac 



6301 

7021 
7031 
7141 
7201 

V %f irf 

7322. 
7 331 

7S01 
75S1 
7621 
7531 
7741 
78Q1 
7861 
7921 
79$1 

ex oi 

61S1 
6221 
82S1 
8341 
8401 
8451 
652:1 
8561 
3 $4 1 
S701 
9761 
8321 

ssei 

8941 
9 001 
9061 
9121 
9181 
9241 
S301 
93«1 
S421 
S4SS. 
SS41 
3601 

9721 

5781 

3841 

9901 

996 3. 

10021 

10081 

103.41 

10201 

10261 

10321 

103 81 

10441 

10503 

105S1 



137 



WO 2004/00315? 



PCT/US2003/020389 



SEQ id NO : 4 2 { pTaMOD (CMV-CHOVg-en?;- Frolnsul in- synPA) ) 



25 



30 



55 



65 



cctggagacg 
tccgcggccg 
cegcctatag 
cttggggcct 



1 ctgacgcgcc ctrgtagcggc gcattaageg eggcgggtgt ggtggttacg egeagcgtga 
61 ccgctacsct tgccagcgcc efcagegcecg ctcctttcgc fcfcfccttccct tccteteccg 
121 ccacgttcgc cggeaccaga ttggctattg gccatfcgeat acgtegtatc catatcataa 
181 tatgtacatt tatactggct catgcccaac attaccgeca tgttgacatt gattattgac 
241 tagttaCbaa fcagtaafce&a fctacggggcc attagttcafc agcceatata tggagttccg 
301 cgttacataa cttaeggfcaa atggcccgcc tggctgaecg cccaacgacc cccgcccatt 
361 gacgteaata atgaegtafcg ttcccatagt: aacgccaata gggacfcttec afcfcgacgtca 
421 atgggtggag tatttacggt aaactgccca cfctggcagta catcaagtgt atcaeatgec 
483. aagtacgcac ccfcattgacg tcaatgacgg taaatggecc gcetggcafct atgcccagta 
541 catgaccfcta tgggacttte ctacttggca gt&eaeefcac gtattagtca tcgctattac 
nai catggtgatg eggttttggc agtacaccaa tgggegtgga tagcggtttg acteacgggg 
Ei61 attfcccaagt ctccacccca t-tgacgtcaa tgggagtttg ttttggcaec aaaatcaacg 
721 ggactctcca aaafcgtegta acaacfceegc eceafctgaeg caaat:gggeg gfcaggegtgc 
781 acggcgggag gtccatafcaa gcagagcfccg ttfcagfcgaac cgtcagatcg 
841 ccatccacgc tgttttgacc fcceatagaag aeaccgggac cgatecagcc 
PGI ggaacggtgc atCggaaegc ggattccccg tgccaagagt gacgtaagfca 
l«61 actctatagg cacacccctc tggetcfcfcafc gcafcgcfcata ctgcctttgg 
1021 atacaccccc gcttcc&cat getataggtg atggtatagc fctagcccaca ggtgcgggtt 
1081 attgaccatt afctgaccact cccctattgg tgacgatact ctccattact aatccataac 
1141 atggqtcttc gceacaaeca cctctatcgg c^atafcgcca attactctgfcc cttcagagac 
1201 tgacacggae tctgtatett tacaggatgg ggtcccattt attatttacs aactcacata 
251 tacaacaacg ecgtcccccg fcgcccgcagt t&ttafctaaa cafcagcgc.gg gatctccaeg 
cgaatctcgg gfcacgtgttc cggacafcggg ccectctccg gtagcggcgg agcttecaea 
tecgagccct ggtcceatgc ctccagcggc 
aeagtggagg ccagactfcag gcacageaca 
gccgcggcgg tagggtacgc gcccgaaaat 
geagatggaa gacttaaggc agcggcagaa 
tgafcaagagfc cagaggtaac tcecgfctgeg 
tgagcagtac tcgttgctgc cgegcgcgcc 
ctgttccttt ccar.gggcct tttccgcagt 
1801 ttacatgatt ctctfctaeca attcfcgcccc gaafctacact taaaacgact csacagctta 
:\B61 acgbtggctt gecacgcatt acttgactgt aaaactctca ctcctaccga acttggccgt 
aacetgcc&a ccaaagcgag aacaaaacat aacafccaaac gaatcgaccg attgctaggc 
a&fccgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttaectgt 
tcgggcaafca cgatgcecat tgtacfctgtt gactggtctg atattcgfcga gcaaaaacga 
cfctatggtat tgcgagcttc agtcgcacta cacggtcgtfc ctgttactct ttatgagaaa 
gcgfcfccccge tttcagagca atgttcaaag aaagcccatg accaatttct agecgacctt: 
2221 gcgagcattc taccgagtaa caccacaccg ctcattgtca gtgatgctgg ctttaaagtg 



1321 
13S1 
1441 
1501 
1561 
1621 

i?ai 



: 981 
(;04i 
?.ioi 

2101 



tcar.ggtcgc tcggcagctc 
acgcccacca ccaccagtgt 
gagcgtggag attgggctcg 
gaagafgcag gcagctgagt 
gtgctgttaa cggtggaggg 
accagacata atagctgaca 
caccgucgga ccatgtgfcga 



crctgctcctra 
gccgcacaag 
cacggctgac 
tgttgtatcc 
cagfcgtagtc 
gactaacaga 
acttgatatc 



taagtcgagt 
ccagcaactt 
ctaaaagcaa 

1^9 



aagaggaaaa 
acatgacatg 
cccaatctca 
ctcgacacgg 
gccatgggtt 



I'281 ccatggtaca aatccgfctga gaagctgggt tggtactggt 

1341 gtacaatacg cagaeceagg agcggaaaac tggaaaccr.a 

'*.401 tcatctagtc actcaaagac tttaggctat aagaggctga 

2461 tgccaaattc tattgtataa atctcgctct aaaggccgaa 

2521 actcattgtc accacecgtc acccaaaatc tacrcagcgt 

2581 ctagcaacta aettacctgt tgaaattcga acacccaaac aacttgt.tas tatctattcg 

2641 aagegaaxgc agattgaaga aaccttccga gacttgaaaa gtectgccta cggactaggc 

ccacgccaca gccgaacgag cagctoagag cgttctgata tcatgctgct aatcgccccg 

s^gcttcaac taacatgttg gcttgcgggc gtfccatgctc agaaacaagg ttgggacaag 

ctaacacagt cagaaatcga aacgtactct caacagttcg cttaggcacg 

ccacacaata acaagggaag acttactcgt ggctgcaacc 

cacacatggt tacgctttgg ggaaatcatg ataacgatcc 

aagatcagag ctctagagat ctgtgtgttg gttttttgtg 

fcgccagccat ctgttgtttg cccctccccc gtgccttcct 

cccactgtcc tttcctsata aaatgaggaa actgcatcgc 

tctattctgg ggggtggggt ggggcagcac agcaaggggg 

aggcatgctg gggacgcggt gggctctatg ggtacctctc 

tctccccctc tctctctctc r.ctctcggca cccccccccc cccctctccc 



2701 
27S1 
2821 
28B1 
2S»41 
3001 
306*1 
3121 



3241 

: 3oi 

.•■361 
3421 
3481 
3541 
i'601 
/661 
?.?21 
3781 
3 841 



eacfcfcceagg 

gaagcctcgic 
ccactagccc 
agatcacctc 
gasccgccgt 
tgaccctgga 
at?;gtctgag 
aggatrtggga 
tctctctctc 
tctctctctc 
tgccttttat 



ggcaritctgg 
aaaatttatt 
tggctaataa 
gccttctagt. 
aggtgccact 
taggcgtcac 



;ctctctctc tcggtaccag gtgccgaaga avtgacccgg tgaccaaagg 



catcacttta aaaataaaaa 
attaatt.acg atcgacgccs acatcacaac 
ctcagaaagt acatctgaac atfcatettga 
ccctatccaa gaagtgatgc ctatcattgg 
aatacatcac tggtaaggta aacgccatxtg 



acaattactc agcgcctgtt ataagcagca 
aaaaactgac ttaacaaacg gttggtctgc 
ttatattatt gataataata aaaaccttat 
tfcggaatgaa cttgaaaaaa attagccttg 
tcagcaaatt gatceaagag aaccaaccta 



aagctttcct gacggaatgC taattctcgt tgaccctgag cactgatgaa Ccccctaatg 
afctttggfcaa aaatcattaa gttaaggtgg atacacatct tgf.catatga fceccggtaat 
S^S^St^^SC tcaceeatta ggcaccccag gcttcacact ttatgcttcc ggctcgtatg 
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3 3 01 tfcgtgtggaa ttgtgagcgg 



3951 gccaagcgeg caatt&aece 
4 021 gg ; ccgcfc c t :-a gaaG tagtgg 



ataaeaattt cacacaggaa acagctatga ccatgattac 



teaetaaagg gaacaaaagc tiggagctcea ecgcggtggc 

ateccceggg catcagattg gefcafctggee attgeatacg 

4 061 tfcgtatcca v stcataatat ^Lacatttat attggefceat gtccaacatt accgccatgt 

5 4 141 tgacattgat tattg^ctag ttattaatag taatcaatta eggggtcant agttoar.agc 

•''20 1 ccatatatgg agttccgegt tacataactt aeggtaaatg gcccgcctgg ctgaccgccc 

•i251 aacgacccec gcccattgac gtcaataafr,g acgCatgtfce ceatagtaac gecaasraggg 

■J321 aetttccatt gaegtcaatg ggtggagtat. Cfjacggtaaa ccgcccacct ggcagtacat 

•0 81 caagtgtatc a t a t geoa ag v. a c g <c c c: c c: fc actgaegeca atgacggtaa atggccegce 

10 4441 tggcatfcatg eccagv.aear: gacci-.tatgg gactfctccta cttggcagta catctaegta 



4 501 t fc ag v a a i . c g c- 1 a 1 1 a c c a t 
y SSI eggtttgact caeggggatt 
4 621 tggcaccaaa ateaaeggga 
4631 atgggcggta ggcgtgcacg 



15 4 741 cagatcgcct. ggagaegcea 

4801 tccagcctcc gcggccggga 



ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag 
tccaagtcfcc caccccattg aegtcaaegg gagtfcfcgtcfc 
at t tccaaaa tgtcgt aaca actccgcccc attgaegcaa 
gtgggaggtr.c tatataagca gagctcgttt agtgaaccgt*. 



35 5 941 tcactcctca etiagagaca 

vOQl cagect cgee agtagacttt 
6 061 gr-gtgc.gaag gaactgtata 



40 



tccacgctgt tttgacctec atagaagaca ccgggaccga 
aeggtgeatt ggaacgcgga ttccccgtgc ctaagagfcgac 
4 861 gtaagtaccg cctatagact etataggcac acccctttgg ctctfcatgea tgctatactg 
4321 tttttggctt ggggectata cacqcGcgct. tcccr.atget ataggt.gatg gtatagctta 
4 981 gectataggt gtgggttatt gaceattatt gaecactcec cfcatfcggtga egataefctte 
20 5 041 catcactaat cea t a aca v.g gc tat 1 t gec aeaactatct ctattggcta tatgeeaata 

5101 ctctgfccctt qagagactga caeggactet gfcatfctteac aggatggggt cccatttatt 
5161 attcacaaafc l-oacafcafcac aacaacgccg fcecccegtgc ccgcagtttt tattaaacat 
5221 agegegggat e t c« aegega ate v egggea cgcgttccgg acatgggcfcc tuacceggta 
52 63 geggeggage ctccttcatcc gagccctggt cccatgcctc cagcggctca tggtegcteg 
25 5341 gcagctcett gctcctaaca gtggaggcea gacttaggca cagcacaatg cccaecaeca 

54 01 ccagfcgtgcc gcacaaggcc gtggcggtag ggcatgtgfcc fcgaaaatg&g cgfcggagatt 
54 51 gggctcgcac ggctgacgea gatggaagac tUciaggcagc ggcagaagaa gafcgeaggea 
5521 gccgagttgt tgtactctga taagagtcag aggtaactcc cgttgeggtg ctgttaacgg 
?«5B1 tggagggcag ■cgtagtctga gcagfcactcg ttgctgccgc gcgcgccacc agscataata 
30 )>64i gctgacagac taacagac tg 1. 1; cc 1 1 1 cca tgggtctttt ctgeagtcac cgfccgggatc 

5701 catgggctcc ateggegeag caagcatgga at -rttgtttt gatgtattca aggagctcaa 
5761 cigtccaccat gecaatgaga acatcttcta ctgccccatt gecatcatgt cagctctagc 
5821 caeggtatae ctgggtgcaa aagacagcae caggacacag ataaataagg tv.gctcgcct 
:;S6:i tgatasactt ccaggattcg gagacagtat tgaagctcag tgtggcacat etgtaaacgfc 

tcctcaacca aatcaccaaa ccaaatgatg fcttattcgtt 
atgcfcgaaga gagataccca avcctgccag aatacttgea 
gaggaggctc ggaacccatc sacttccaaa cragctgeaga 
6121 teaagecaga gagctcatca attcctgggt agaaagtcag acaaafcggaa fctatcragsaa 
^181 tgtccttcag ccaagctccg tggatcctca aactgeaatg gttctggtta aegecattgt 
6241 cttcaaagga ct9tgggaga aaacatttaa ggstgaagac acac-aagcaa tgcctttcag 
$301 agtgactgag caagaaagca aacctgtgca gatgatgtac cagattggfcfc tatttagagt 
6*361 ggcatcaatg gcttctgaga aaafcgaagafc cctggagctt ccatttgcca gtgggacaat 
6421 gagcatgttg gtgetgetge ctgatgaagt ctcaggcctt gagcagcttg agagtataat 
54S1 caactctgaa aaactgactg aacggaccag ttefc&atgtv, &tgQcs3o[&ci& ggaagatcaa 
45 65^1 agfcgtactta ccccgcafcga agatggagga aaaatacaac ctcacatctg tcttaatggc 

6601 tatgggcatt actgacgtgt ttagcccctc agecaatctg tctggcatct cctcagcaga 
66S1 gagectgaag atatctcaag ctgtecatgc ageacatgea gaaar.caatg aag^aggcag 

6 721 agaggtggta gggtcagcag aggctggagt ggatgetgea agegtctctg aagaacttag 
6731 ggctgaccat ccattcctct tctgtatcaa gcacatcgea accaacgccg tfcctctfcctt 

50 ^841 f.ggcagatgt gtttcccgcg gceagcagat gaegcaecag cagatgaege accagcagat 

*>Sf01 gacgcaccag cagatgaege accagcagat gacgcaccag cagatgaege aacaacatgt 
atcctgaaag gctcttgtgg etggatvcggc ctgctggatg acgatgacaa afcttgtgaac 

7 021 caacacctgt gcggctcaca cctggtggaa gctctctacc tagtgtgcgg ggaacgaggc 
7 0S1 ttcttctaca cacccaagac ccgeegggag gcagaggacc tgeaggtggg gcaggtggag 
"?i.4:i ctgggcgggg gccctggtgc aggcagectg cagcccttgg ccccggaggg gtccctgcag 
^201 aagcgtggca ttgtggaaca atgctgtacc agcatccgct ccctctacca gctggagaac 
"2&I. tactge-aact agggegecta aagggcgaat tatc^cggcc gctctagacc aggegectgg 
^321 atccagatca cttctggcta ataaaagatc agagctctag agatctgtgt gfctggttttt 
7381 tgtggat:ct.g ctgtgcgttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct 
7441 tccttgaccc tggaaggfcgc cactcccact gtcctttcct aataaaatga ggaaattgea 
7 SOI tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca gcacagcaag 
7561 ggggaggatt g«gaagacaa tagcaggcat gctggggatg cggcgggctc tatgggfcacc 
7521 tci:ccci:ctc tctctctctc tctctctctc tctetctccc ggtacrctctc ctcgaggggg 
7£81 ggcccggtac ccaattcgcc ctatagtgag tegtattacg cgcgctcact ggccgtcgtt 

«5 7 741 ttacaacgtc gtgactggga aaaccctggc gttacccaac ttaatcgect tgcagcacat 

7S01 ccccccr-rcg ccagctggcg fxaa^agcgaa gaggcccgca ccgatcgccc ctcccaacag 
7 361 ttgcgcagcc 'ogaatggega atggaaatt.g taagcgttaa tattttgtta aaattcgcgt 
7921 taaatttttg ttaaatcagc tcatttttta accaataggc egaaategge aaaacccctt: 
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/S8I 
3041 

sxoa 

8161 
•'t221 

ri£81 

8341 
8401 
8461 

lisai 

8581 
S841 
8701 
S761 
8821 
S8S1 
8S41 
9001 
9061 

5? 181 
9241 
:->30l 
<?361 
9421 
$481 

$601 
V661 
9721 
97S1 
£841 

S9&1 
1G02I 
10081 
10141 
10201 
10281 
10321 
10381 
10441 
.3.6 501 
10561 

10681 
10741 
10801 
10661 
lf'&21 
1< 981 
11041 
11101 
1 5. 161 
221 



atiaaateaaa 
cactattaaa 
gcccactacfc 
c&aaatcafct 
aagetc.acga 
acatgegaaa 
cggcttttta 
ttctttateg 
aaeateattt 
ggggttaaca 
atceaaatee 
gggccaataa 
tgatgactct 
ccagecaata 
gcaaatgc&c 
tggctafctct 
tgaaaagcca 
cggatcggcc 
gtafcaeegar. 
gg 1: c a t age t 
ccggaagcat 
cgtfcgcgetc 

tCggCCaaeg 

cfcgaetcgct 
t&atacggtt 
agca&aaggc 
ccectgacga 
^afcaaagata 
tgecgeefcac 
gcteaegctg 

3^C!CCggtaag 
Cg agg £ ® ^* 9 t 

gaaggacagt 
gfcagetcttg 
agcagattac 
ctgacgccea 
ggafccfctcac 
atgagtaaac 
tctgtctatt 

caacttxace 
cgceagttaa 
cgtcgtttgg 
ctcccatgtt 
agtcggccgc 
tgecatccgix 
agtgtatgeg 
atageagaae 
ggatcttaec 
cacseatcttt 
caaaaaaggg 
afcfcattgaag 
agaaaaataa 



agaafcagacc 
gaaegtggac 
ccgggateat 
aggggattca 
tga tgatgtg 
aaeefcaaaag 
ctgagcttga 
taaaaaatge 
ggegacgaaa 
cgaaggtcafc 
agecatccca 
caccggttgc 
ttgtttggafc 
a&atta&aac 

tectgccaca 
accatcacgc 
accaatgege 
cagetcttgfc 
gttfcectgfcg 
a&agtgta&a 
aofcgcccgefc 
egcggggaga 
gcgcteggtc 
atccacagaa 
e&gga&ccgt 
gcafccacaaa 
eeaggcgtfcfc 
cggataectg 
taggtatete 
egtteagcce 
acacgacfcta 
aggcggtgct: 
atttggtafcc 
atccggc&aa 
gcgcagaaaa 
gcggaacgaa 
ct&oatcct t 
tt;ggtctgac 
tcgtt.cas:ce 
aecatctggc 
aceageaafca 
cgcctceatc 
tagfcttgegc 
tatggctfcca 
g tgcaaaaaa 
agegttatca 
aagatgettt 
gcgaccgagt 
tttaaaagtg 
gctgttgaga 
tacttteacc 
aataagggcg 
cattfcatcag 
acaaataggg 



gagatagggt 
tccaacgtca 
atgacaagat 
tcagtgctca 
ctfcaaaaacc 
agettgccga 
aagataaata 
eetcttgggt 
taactaatrca 
egatageagg 
aattggtagt 
attggcaagg 
agae&tcact 
agggaaaact 
aar.aaatccg 
aaggcctgga 
tat tea teat 
cgaaa~aata 
tcccttfcagt 
tgaaattgtr. 
gcctggggtg 
ttccagtegg 
ggcggtttgc 
gfctcggctgc 
tcagggcata 
aaaaaggecg 
aatcgacget 
ceccetggaa 
tccgcctttc 
agtxcggtgt 
gaccgctgcg 
tcgccactgg 
acagagttct 
tgcgctctgc 
caaaccaccg 
aaaggafcetc 
aactcaegtt 
tfcaaattaaa 
agfctacca&c 
ar.agttgcct; 
cccagtgctg 
aaecagecag 
essgtctatta 
aacgttgtcg 
tfccagctecg 
geggttagcc 
ctcatggtta 
tctgtgactg 
tgetcttgee 
ctca"cattg 
tccagttcga 
agcgttfcctg 
acaeggaaat: 
ggttattgtc 
gttccgcgca 



tgagtgttgt 
aagggcgaaa 
gtg fcat oca c 
gggtcaacga 
tactcaatgg 
taaaaaaggc 
aaatagata9 
tatcaagagg 
cttgtetcct 
ataataatac 
gaatgattat 
etcaccaata 
ccctgtaatg 
aaccraacctt 
agcagtactg 
atactgagtg 
cacgatttct 
accaacaaat 

atccgctcac 
cctaatgagt 
gaaacct0tc 
gtattgggcg 
gecgageggt 
aegcaggaaa 
cgttgctggc 
caagtcagag 
gctccctcgt 
ceccttcggg 
aggtegtteg 
ccttatccgg 
cagcagccac 
tgaagtggtg 
tgaagccagt 
ctggtagcgg 
aagaagatcc 
aagggatttt 
aatgaagttt 
getxa&fccag 
gactccccgt 
caaugatacc 
ceggaaggge 
attgttgccg 
ccrat tgetae 
gttcccaacg 
ccrttcggtcc 
tggcagcact 
gtgagtactc 
eggegteaat 
gaaaacgttc 
t0taacccac 
ggfcgagcaaa 
gttgaatact 
teatgagegg 
cat ttccccg 



tccagtttgg 
aacegtctat 
cttaacttaa 
gaattaacat 
ctggttacgc 
caatctattg 
gttttatttg 
gtcattatat 
gtttacfceec 
agtaaaaege 
aaataacagc 
atccctgtaa 
caggtaaagc 
cagatataaa 
ccgttttttc 
taaaagacca 
gtaatagcac 
ggcatcgt z& 

aatcccacac 
gagefcaaetc 
gtgccagctg 
ctctfcccgct 
atcagctcac 
gaaeatgtga 
gtt t fctccat 
gtggcgaaac 
gcgctctcct 
aagcgtggcg 
ctccaagctg 
taaetatcgt 
tggtaacagg 
gcctaactac 
taccttegga 
fcggtttttfcfc 
tctgatcttt 
ggtcatgaga 
caaatcaatc 
tgaggcacct 
cgtgt'agata 
gcgagaccca 
c^agegca^a 
ggaagctaga 
aggcategtg 
ateaaggega 
tecgategtt 
gcataatcct 
aaccaagtca 
acgagataat 

^^egssgega 

tcgtgcaccc 
aacaggaagg 
catactcttc 
atacatatfct 
aaaagtgcca 



aacaagagtc 
cagggegatg 
tgatfctttac 
tcegtcagga 
atategcaat 
ctatttaccg 
aagctaaatc 
ttcgcggaat 
ctgagcttga 
taaaccaata 
aaacagtaat 
agcaccttgc 
gatcccacca 
cgctaaaaag 
gcccs*c£.cag 
agaccegtaa 
cacacegtgc 
aataagegat 
gegtaatcat 
aacatacgag 
acattaafctg 
cattaatgaa 
tcctcgctca 
teaaaggegg 
geaaaaggee 
aggctccgcc 
ccgacaggac 
gccccgac.ee 
ctttefceata 
ggctgtgtgc 
cttgagr.cca 
atcagcagag 
ggctacacta 
aaaagagctg 
gtttgeaage 
tetaeggggt 
ttatcaaaaa 
caaagcatat 
atetcagega 
actacgatac 
cgc tea cegg 
agtggtcctg 
gtaagtagtt 
gtgtca-^gcc 
gctacatgat 
gtcagaagta 
cttacCgtca 
ttctgagaac 
accgcgccac 
aaactctcaa 
aactgatctt 
caaaatgecg 
ctfcttccaat 
gaatgtattt 
c 



SEQ ID NO: 43 (pTnMOD (Chicken OVep-^OVg ' +£SNT+proins+syn polyA) ) 



1 ctgacgcgcc ctgtagcggc geactaageg cggcgggtgt ggtggttacg cgcagcgtga 
61 ccgctscact cgccagcgcc ctagcgcccg ctcctttcgc tctcttcect tcctttcccg 
121 ccacgttcgc eggcatcaga ttggctattg gecattgeat acgttgtatc catatcataa 
1S1 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 
•241 cagttattaa tagtaatcaa fctacggggtc attagtr.cat agcccacata rggagctccg 
•301 cgttacacaa ctfcacggtaa afcggcccgcc tggctgaccg cccaacgacc ccogcccatt 
'361 gaegtcaata atgacgtatg ttcccatagt aacgecaata gggactttcc attgaegtea 
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10 



15 



20 



25 



30 



35 



40 



50 



55 



60 



65 



421 
481 
541 
€01 
661 
721 
731 
S41 
901 
961 
1021 
1081 
1141 
r.203 

r.zsi 

x321 
„381 
1441 
?.501 
C.S61 
.T. 621 
id SI 
1741 
1601 
ISO 
1321 
1981 
2 041 
2101 
216*1 
2221 
2231 
2341 
2401 

J521 

::64i 

7701 

2761 

2 621 

; : 88i 
r 94i 
i'ooi 

3051 
J X 2 1 

3 181 

3241 
^ ft i 

-J ■ V S> 

3361 
3421 
3481 
3 541 
3S01 
3 661 
3 721 
3 731 

3 841 
3901 

4021 

4081 
4141 

4 201 
"261 
4-321 
4381 
4441 



ATLLB302 5 33492 \ 



atgggtggag 
aagtacgcec 
eatgacctta 
catggtgatg 
at tt ecaagt 
ggactttcca 
aeggtgggag 
ecatccacgc 
gga acggtgc 
actctafcagg 
atacaccccc 
attgaccatt 
afcggetefctfc 
tgacacggae 
tacaaeaacg 
cgaafcctcgg 
tccgagcect 
aeagtggagg 
gccg tggegg 
gcagatggaa 
tgataagagt 
tgagcagtac 
ctgfcteettt 
ttacacgact 
acgfc tggctt 
aacctgccaa 

tcgggcaata 
cttatggtat 
gcgt tcccge 
gcgageattc 
ccatggbata 
gtacaatatg 
tcatctagtc 
tgeeaaattc 
aetcattgtc 
ctaecaacta 
aagcgaatgc 
etacgecata 
atgctteaac 
caettccagg 
gaagttttgc 
ctactagctc 
tctagagega 
ctttaaaaat 
tgcctacatc 
tgaacattat 
gatgecfcatc 
aggtaaacgc 
aatgttaatt 
attaagfctaa 
cattaggcac 
agcegafcaac 
aaecctcact 
agtggatccc 
extgacctga 
cagagagaaa 
attcatctgt 
atgaaaagge 
tgctccttec 
gtaggtfctta 
ttttggataa 
tggt ct: aggg 
ctgaectfctt 
ttgcacagcc 
geaagaagat 
ggtagagata 
agactttctt 



catttacggt 
cctaxtgaeg 
tgggacfctfce 
cggtttfcggc 
ctceaeccca 
aaacgtegta 
gtctatataa: 
tgtcttgacc 
afctggaacgc 
cacacccctt 
gcttccct&t 
attgaccaefc : 
gccaeaacfca 
tctgtatttt 
ccgtcecccsg 
gtacgtgtfcc 
ggtcccatgc 
ccagact t:ag 
tagggtatgt 

gaettaagge 
cagaggtaae 
tcgttgctgc 
ccatgggtet 
ctctttaeca 
gceacgcatt 
ceaaagcgag 
tceacaaaga 
cgatgceeat 
tgcgagctte 
1 1 1 cagagca 
tacegagtaa 
aatccgttga 



aaaetgccca 
t caa tgaegg 
ctaetcggca 
agtacateaa 



acaactcegc 
geagagctcg 
tccatagasg 
ggat tceceg 
tggctcctat 
get. at aggtg 
ceectattqa 
tctetatfcgg 
tacaggatgg 
tgcecgeagt 
cggacatggg 
je 



aetcaaagse 
tattgtacaa 
accacccgfcc 
ac tta.ee. t'gt- 
agat tgaaga 
gecgaacgag 
taaeatgttg 
ctaacacagt 
ggcatfcctgg 
a aa a 1 1 a tl* 
teegggatet 
aaaaaacaat 
acaacaaaaa 
ct tgattata 
afctggttgga 
cat fcgt cage 
etegttgaec 
ggtggataca 



aatttcacac 
aaaggg&aca 
ccgggctgca: 
tacccgattt 
eeatcaetga 
gaectgagca 

taatgtcaaa 
gtgattggat 
aaagtgcttt 
acagacccac 
cfctgggacaa 
gtgctgggca 
tgttgectae 
aeatttacfcg 
agtggcfcgaa 



gcacagcaca 
gtctgaaaat: 
ageggcagaa 
tcccaetaea 
cgcgcgcgc c 
c.ttctgcagt 
actctgcccc 
acttgactgt 
aacaaaacat 
gcgactcgct 
tgtacttget 
agvegsacta 
atgttcaaag 
caccacaccg 
gaagctgggc 
ageggaaaac 
tttaggctat 
atctcgctct 
: : acctaaaatc 
cgaaattcga 
'■■ aacct fcc C0 a 
ca^ctcaaag 
gettgeggge 
cagaaatcga 
ctacacaata 
cacacafcggt 
egggaaaage 
tacfccagtgc 
ctgatttaac 
ttattgacaa 
acgaacttga 
aaattgatcc 
ctgagcactg 
catc^tgtca 
acactttatg 
aggaaacagc 
aaagcfcggag 
gaaaaatgee 
frctccaaaqt 
tggetacagc 
aaatgattta 

K ^ 1« \ T ;i <^ ^ r>, ^ 

attgtagtgg 
aagaggct 
tacaact *c tc 
aatgaaaf.gc 
gcattgtcaa 
gggcaa tcca 
tccctccaga 
ggaagcacat 
atagaagcaa 
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ctfcggcagta 
taaatggecc. 
guacatctac 
^9S9C'gtg<?a 
tgggagcttg 
cccattgacg 
tttagtgaac 
acac'cgggac 
tgccaagagt 
geatgetat a 
atggtatagc 
tgaqgatact 
etatatgeca 
ggtcccat t fc 
ttttattaaa 
ctcttctccg 
teatggtege 
atgcccacca 
gagcgtggag 



gcgctgctaa 

accagacata 

caccgt egga 

gaattacact 

aaaaetctca 

aacatcaaac 

gtav.ac.cgtt 

gactggcctg 

caeggtagtt 

aaagctcat.g 

ct cat tgt era 

fcggtactggt 

fcggaaacc t a 

aagaggctga 

aaaggccgaa 

tactcagegt 

acacccaaac 

gacttgaaaa 

cgttttgata 

gticeacgctc 

aaegtactet 

acaagggaag 

tacgetttgg 

gcuggtgacc 

ctgttataag 

aaatggttgg 

taataaaaac 

aaaaaattag 

assgagaacca 

acgaatcccc 

tafcgatcccg 

cttccggctc 

fc&tgaccatg 

ctccaccgcg 

^9S^99^^ta 

S9B9^aacaa 

accaaggtat 

tctctccatg 

gcaacaaaga 

c^aagaggag 

gacctgt^ag 

aggtctocga 

ctggcatagg 

acaatgtgtg 

t tgccaccua 

aagcttccgc 

cca tea teat 

aagacgtga t 



catcaagtgt 
gcccggcatt 
gtattagtca 
tagccgtttg 
ttttggcacc 
caaatgggcg 
egtcagateg 
cgatccagcc 
gaegtaagta 
ctgtttttgg 
ttagcctcJta 
ttccatfcaet 
atactctgtc 
attatttaca 
catagcgtgg 
gtageggegg 
tcggcagctc 
ccaccagtgt 

attgggctcg 

g cage t gag t 

cggtggaggg 
atagctgaca 
ccatgtgcga 
taaaacgact 
ctettaccga 
gaatcgaccg 
ggcatgetag 
atattcgtga 
ctgttacfcct 
accaatttct 
gtgatgctgg 
taagtcgagt 
tcagcaacfo 
ctaaaagcaa 
aaaatccsgcg 
eggcaaagga 
aacttgttaa 
gtcctgccta 
teatgetget 
agaaacaagg 
caacagtfc eg 
aettactcgt 
ggaaattatg 
aaaggtgcct 
cagcaattaa 
tetgeettag 
cttatcccta 
ecttgaatac 
aettaaagct 
taatgatttt 
gtaatgtgag 
gtatgt tgtg 
attacgecaa 

tgaacte^ea 

gcaatggcaa 
aatggt tget 
caaacagaga 
aacaaaatct 
ctcacctgga 
gtctttattc 
aaagggcagc 
acaaaactat 
tcecaggtaa 
agactgacat 
aaaaagcagg 
taaaaacaaa 



atcatatgec 
acgcccagta 
tegctattae 
actcaegggg 
aaaafceaacg 
gtaggcgtgt 
cctggagaeg 
tccgcggecg 
ccgcctstag 
ettggggect 
99tgtgggtt 
aat ccataac 
cttcagagac 
aattcacata 
gatctecacg 
agcttccaea 
cttgetccta 
gccgcacaag 
cacggct.gae 
tgttgtattc 
cagtgtagtc 
gactaacaga 
actcgatatt 
caacagctta 
acttggecgt 
attgttaggt 
ctt tatctgt 
gcaaaaacga 
ttatgagaaa 
agccgacctt 
ctttaaagtg 
aagaggaaaa 
aeatgatatg 
tecaatCvca 
ctcgacacgg 
gccatgggtt 
tatctattcg 
eggactaggc 
aatcgccetg 
ttgggacaag 
cttaggcatg 
ggetgeaacc 
aggggatege 
tttatcatca 
ttatgatcga 
aaagtatatt 
tccaagaagt 
attaetggta 
ttcctgaegg 
ggtaaaaatc 
ttagetcact 
tggaattgfcg 
gegegcaatt 
ctctagaact 
trccaaaggag 



caaaacagct 
tccattcgae 
tctttccetc 
aeaatcaatg 
caagv.tctga 
ct teat a tec 
atgagactgt 
ag*jgccttag 
t tgtactget: 
crct tccaaet 
gcatttcaca 
caagattutc 
atgaaaeaaa 
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ATLU302 {S343i. f 



aaaaatcagt 
tcttgtctta 
fccacaaaagg 
taggaaagta 
atgttgtact 
ttctttactg 
tatafcttgct 
ttcacaatxc 
cactgcaagfc 
ctatgttttg 
gatagattta 
agfctggctce 
tcacgtcttg 
agaaggatca 
aaefcaataat 
aaagatceca 

C t, t C uCtCC^ 

ecagaattaa 
aatctaaccc 
acetgtgggt 
ataeagctag 
ccatgggctc 
aa^tccacca 
ccatggtaca 
ttgataaact 
ttcactctkc 
eeagectegc 
agtgcgfcgaa 
atcaagccag 
atgtecxtca 
tct fccaaagg 
gagtgactga 
tggcatcaafc 
egagcatgtt 
tcaacttfcga 
aagtgtactt 
ctatgggcat 
agagcctgaa 
gagaggtggfc 
gggctgaeca 
£tggcagafcg 
atgacgcaec 
gtatcctgaa 
aecaaeacct 
gcteefctcta 

agaagcgtgg 
act&ctgcaa 
ggatceagat 
fc£tgt.ggatC 
ctccct.cgac 
catcgcaccg 

^ggggg&gg^ 

CCtC^»CtCtC 

irctacaaegt 
tccccctttc 
gttgcgcagc 
ttaaattttt 

ecac^attaa 
ggcccactac 
ccaa&atcat 
aaagcttatg 
fcacatgcgaa 
gcggcttttt 
ettctttatc 
taacate&tt 



tgatacctgt 
agtcetcaga 
aaggagagaa 
attctgctta 
tttttccccc 
t ktgtcaatt 
attgtatatfc 
ctctgtcatc 
twijt at cat a 
ctgtatcctc 
aatatc.ccag 
fcfccacafcgca 
tcctcactta 
aatgaaacag 
t.gctaat fcat 
ttatctggtt 
atecaacagt. 
aaactaaxac 
aatcecatta 
gggtcacaat 
aaagcc.gtat 
catcggcgca 
tgccaafcgag 
cctgggtgca 
tccaggatce 
aeut&gagac 
cagcagaect 
ggaactgtat 
agagcfccatc 
gccaagrctcc 
actgtgggag 
gcaagaaagc 
ggctfccfcgag 
ggtgctgtcg 
aaaactgacfc 
acctcgcatg 
taetgacgtg 
gatacctcaa 
agggtcagca 
tccatfcectc 
tgt&tctccg 
agcagatgac 
aggctcttgt 
gtgcggctca 
eacacccaag 
gggccctggt 
c a t t g tgga a 
ctagggcgcc 
cacttcfcggc 
tgctgtgect. 
cetggaaggt 
tctgagcagg 
ctgggaagac 
tctctiecetc 
cccaattcgc 
egtgactggg 
gccagctggc 
cfcgaatggcg 
ge.eaaat.cag 
aagaafcagac 
agaacgfcgga 
tccgggatca 
taggggat tc 
afcgatgacgfc 
aaacetiaaaa 
attgagcttg 
gtaaaaaatg 

tggtgacgaa 



ggf.gr.agac a 
et^ggcaagg 
aeaaaagaaa 
acagagattg 
atttttaaat 
ctactatttc 
atgafcfcgtce 
tgcc&ggcea 
aacacatfctg 
aga&aaaa&g 
etataggaaa 
tgcttctfcta 
tgtcctgcct 
acttctggtc 
gttttccatc 
gtaactgaag 
cctgstggat 
ttgececcca 
aatgatfcfcet 
tcaggctxata 
tgccttcagc 
gcaagcaf gg 
aacae.cttcfc 
aaagacagca 
ggagacagta 
acccccaacc 
tacgctgaag 
agaggaggc *c 
aat cccfcgcfg 
gtggattctc 
aaaacattta 
aaacccgtgc 
aaaatgaaga 
cccgacgaag 
gaafcggacca 
aagatggagg 
tttagctct.t 
gctgtccacg 

g^gg<^^gg^g 

t tctgtatca 
cggccagcag 
gcaccagcag 
§g<^tggatcg 
cacctggtgg 
aoccgccggg 
gcaggcagcc 
caatgctgta 
t aaagggcga 
taataaaaga 
cc?:agtf.gcc 
gccactccca 
tgtcatf.cta 
aar.agcaggc 
tctctctctc 
cc^atagtga 
aaaaccctgg 
gtaatagcga 
aatggaaatt 
cfccattcttt 
cgagataggg 
ctccaacgto 
targacaaga 
atcagcgctc 
gcttaaaaac 
gagctegccg 
aaagataaat 
ccctcttggg 
ataacxaagc 



tccagcaaaa 

agaatgtaga 

atggcactga 

cagtgatctc 

caaacagtgc 

aatacagaac 

ctcgaaccat 

ttaagctatt 

aaatxtcgagta 

tttgttataa 

gaaagtgcs^t: 

tttctcatat 

agGatggctc 

tgctactaca 

tctaaggtfcc 

ctcaatggaa 

tagcagaaca 

ctcaatccaa 

acggcgtcaa 

s:attccccag 

actcaagctc 

aattttgttt 

actgiccccat 

c c agg a cac a 

ttgaagctca 

9aat.caccaa 

agagataccc 

tggaacGtat 

tagaaagtca 

aaactgcaat 

aggatgaaga 

agatgatgta 

tcc&ggagcC 

tctcaggcct 

£jt.fccfcaa£gt: 

aaaaatacaa 

cagccaatct 

cagcacatgc 

tggafcgctgc 

agcacatcgc 

atgacgcacc 

atgacgcacc 

gcccgctgga 

aagctctcta 

aggcagagga 

tgcagccctt 

ccagcacctg 

attatcgcgg 

Ccagagctct 

agccafcctgt- 

ctgtcctttc 

rcctgggggg 

at-gcr.gggga 

tctcccCctc 

gtcgtattac 

cgtitacccaa 

agaggcccgc 

gtaagcgtta 

aaccaatagg 

tugagtgttg 

aaagggcgaa 

cgtgtatcca 

agggtcaacg 

ctactcaatg 

aeaaaaaagg 

aaaacagata 

ttatcaagag 

acttgtctcc 



aaatat tatc 
tttctacagt 
cfcaaacttCa 
tatgtatgtc 
tctacagagg 
aafcagct cct 
gaacactcct 
catggaagat 
ttgttttgca 
agcattcaca 
etgGtiettca 
tttgtcaaga 
agatgcacgc 
accatagtaa 
ccacat $ 1 1 f. 
Catgagcaat 
ggcagaaaac 
aatggaccta 
aggtcaaact 
ggctcagcca 
aaaagacaac 
tgatgtattc 
tgccatcatg 
gataaataag 

g^g-ggcaca 

accaaatgat 
aa'ccctgcca 
caactttcaa 
gacaaat^ga 
gattctggtt 
cacacaagca 
ccagat tggt 
cccatttgce 
tgagcagctt. 
tatggaagag 
cctcacatct 
gcctggcaKc 
agaaaccaat 
aagcgfcctc t 
aaccaacgcc 
agcagatgac 
agcagatgac 
cgacgatgac 
cctagtgtgc 
cctgcaggtg 
i^gccctggag 
ctecetetac 
ccgctctaga 
agagatctgt 
cgtttgcccc 
ctaataaaat 

^ggggtgggg 

tgcggtgggc 
ccggtacctc 
gcgcgctcac 
cr.caatcgcc 
accgatcgcc 
atattttgtc 
ccgaaatcgg 
ctccagt^tg 
aaaccgtcca 
cct. taact fca 
agaattaaca 
gctggttatg 
ccaatt tatc 
ggttittatxt 
ggtcattata 
tgtttactcc 



atatatgtt c 
gctagtcgta 
ctgaagaatt 
tcagaatggc 
ataactgaaa 
ccagctgaat 
cc"tgaggaa 
ttgtatggag 
cccataaaaa 
ctctagtctc 
aaataatagg 
cgtagataca 
taagcacacc 
ctgt.tv.tct.?: 
attteccagt- 
acattgt tac 
tcgaaactaa 
tctgaaggga 
gt.ggatcaac 
tcagagctca 
aaggagct ca 
tcagctctag 
gctgttcgcc 
tctgtaaacg 
gf ttafccgt. 
gaatacttgc 
acagctgcag 
attatcagaa 
aatgccat tg 
atgcctx:tca 
ttatttagag 
agtgggacaa 
gag ag ear: as 
agcaagat ca 
gtcttaatgg 
ccctcagcag 
gaagcaggca 
gaagaattta 
gfcfcctcttct: 
gcaccagcag 
gcaacaacat 
aaatttgtga 
ggggaaccag 

gggeaggtgg 
gggtcccegc 

cagctggaga 
ccaggcgccc 
gtgttggctt 
tcccccgtgc 
gaggaaac eg 
cagcacagca 
tetaegggta 
tctcgagggg 
tggecgtcgt 
ccgcagcaca 

aaaatccgcg 
caaaatccct 
gaacaagagt 
teagggegat 
acgattttta 
ttcegtcagg 
catategcaa 
gctatctacc 
gaagctaaat 
tttcgcggaa 
cctgagcttg 
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:iS3I aggggttaae afcgaaggtca tcgatagcag gafcaataata cagtaaaacg ctaaaccaat; 

1641 aatccaaatc cagccatccc aaatfcggtag tgaatgatta taaataacag eaaacesgtaa 

87 01 tgggccaata aeaccggtcg : cattggfcaag gctc&ccaat aatcectgta aagcaccttrg 

B76I ctgatgactc tttgtttgga tagacatcac tccctgtaafc gcaggtaaag cgafccccacc 

5 '1821 accagccaat aaaattaaaa cagggaaaac fcaaccaacct tcagatataa acgctaaaaa 

1381 ggcaaatgca etactafceyg caataaatcc eg age ag tact gccgttefcCfc cgcecattta 

<j54I gtggctattc tfccctgccac aaaggcttgg aatactgagt gtaaaagacc aagacccgfca 

5001 atgaaaagcc aacc&fccafcg ctatfccatca tcacgatttc tgcaatagca ccacaccgtg 

9061 ctggattggc ratcaacgcg ctgaaataar. aatcaaeaaa tggcatcgtt aaafcaagtga 

10 $121 tgtstaccga tcagcttttg ttccctfctag tgagggfctaa ttgegcgctfc ggcgt.aatca 

9181 tggtcatagc tgttfcccfcgt gtgaaatt'gt fcatccgctca caafctccaca caacatacga 

3241 gccggaagca taaagtgtaa agcctggggt gcctaatgag fcgagctaact cscattaatc 

53 01 gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt cgfcgccagct gcattaatga 

9361 atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 

15 5421 actgactcgc tgcgctcggfc cgttcggct-9 cggcgagcgg tatcagctca ctcaaaggcg 

9481 gfcaatacggt VatccACaga ateaggggat aacgcaggaa agaacaEgtg agcaaaaggc 

3541 cagcaaaagg ccaggaaccg taaaaaggcc gcgtcgctgg ogtttttcca taggctccgc 

5601 ccccctgacg agcatc&caa aaatcgacgc teaagtcaga ggtggcgaaa Gccgacagga 

9S61 cfcataaagst accaggcgtt cccccctgga agececetcg tgcgct.cr.cc tigtcopgacc 

20 3721 ctgccgctta ccggasacct gtecgcctfct ctcccfctcgg gaagcgtggc gcttfcctcafc 

5781 agctcacgct gtaggtatct cagtfccggtg taggtcgttc gctccaagct gggctgtgtg 

^S4i cacgaacccc ccgtteagcc cgacegccgc gccttatccg gtaactafccg tcttgagtec 

^901 aacecggtas gacacgacfct ategccactg gcagcagcca ctggtaacag gattagcaga 

*-i3€l cjcgagqtatg taggcggtgc eacagagtfcc ttgaagfcggt ggcccaseta eggcfcaeact 

25 10021 agaaggacag tatttggtat ctgcgctcfcg ctgaagecag ttaccttcgg aaaaagagtt 

10081 ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggttttte tgt^tgcaag 

10X41 eagcagafcta cgcgcags&a aaaaggatet caagaagatc ctfctgatctt ttctacgggg 

1^20 3. cccgacgctc agtggaacga aaactcaegt taagggattt tggtcatgag attatcaaaa 

aggatettca eetag&tcct ttcaaatxaa aaatgaag^c ttaaatcaax ctaaagtaxa 

30 10321 t «9« 3t «, octets* cgtt.cc. tgctLatca gtgaggcacc eatctLscg 

10381 atctgtcfcat ttcgtccatc catagttgcc fcgactccccg tcgtgtagat aactacgata 

lf<44l cgggagggct. caccanctgg ccccag^gec gcaar.gatac cgcgagaccc acgctcaccg 

1050 3, gctccagatt tatcagcaafc aaascagcca gccggaaggg ccgagcgcag aagtggtccc 

10!S61 gcaactttat ccgcctccat ccac^tctafct aattattgcc gggaagcrag agtaagtagt 

35 10621 tcgccagtta atkgtttgcg caaigttvjtt gccafctgcca caggcatcgt ggtgfccacgc 

I06S1 tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga 

10741 tcccccatgt cgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt 

10801 a&gtLggccg cagtgctatc actcatggtt atggcagcac tgcataattc tettactgtc 

1G861 atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc actctgagaa 

40 10521 tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taecgcgcca 

103S1 catagcagaa cfcttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca 

E1041 aggafcctfcac cgetgtugag atccagtccg atgtaaccca ctcgtgcacc caactgatct 

1x101 t.cagcatctt ttsctctcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc 

1M61 gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa 

45 1...221 tattattgaa gc«cttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt 

1..2S1 tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc ac 
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